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, Abstract. Interacting systems consisting of two rotators and a pendulum are consid- 

t-H ' ered, in a case in which the uncoupled systems have three very different characteristic 

time scales. The abundance of unstable quasi periodic motions in phase space is stud- 
ied via Lindstedt series. The result is a strong improvement, compared to our previous 
results, on the domain of validity of bounds that imply existence of invariant tori, large 
homoclinic angles, long heteroclinic chains and drift-diffusion in phase space. 

<n : 

^j- ■ §1. The Hamiltonian system. 

O ! 3 3 

1.1. Let (tp, ati, a.2) = (tp,ot) € T be three angles (i.e. positions on circles); let (I,Ai,A 2 ) = (-^,A) S ™ 
be their conjugate momenta (or "actions"). We consider the Hamiltonian function, depending on two 
parameters E,r) > 0, defined by 

f"TN ' A 2 I 2 

i n = ^n 1 A 1 +T 1 ^j+T 1 -^n 2 A2 + — + J a gl(co8ip-l)+sf{ip,a 1 ,a2), (1.1) 

with / an even trigonometric polynomial of degree N, Nq in a, tp respectively; fii, SI2, J, Jo> go are positive 
' O I constants. 

q ■ This system describes two rotators (one anisochronous, labeled #1, and one isochronous, labeled #2) 

interacting with a pendulum which has its free (i.e. with e = 0) unstable equilibrium position at / — 0, tp = 
■ and the stable one at J = 0, tp = 7r. The scale of frequency of the pendulum is 0(1) in 7/; at the same time 

the two rotators rotate at constant speed 0(772) and 0(77" 2) respectively. Hence the system has three time 
scales: we assume 77 < 1 so that the slow rotator is the #1 rotator. 

The free motion admits invariant tori of dimension 2 (namely parameterized by A, a constant vector, by a 
arbitrary, and with I = 0, (p = 0) which are unstable and possess stable (labeled +) and unstable (labeled 
— ) 3-dimensional manifolds (parameterized by A, the same constant vector, by a,tp arbitrary, and with 
I = ±J g y/2(l - cosy;)). 

We shall study properties that eventually hold when 77 — > 0. It is well known ([HMS,CG] for instance) that 
if e is small most of the unperturbed tori and their manifolds still exist, just a little deformed. This means 
that (under the condition stated below) there exist functions U^, (<p, a) and VT; (y, a ) which are divisible 

by e and analytic in a, ip, e, for a G T 2 , \ip\ < 2n, \e\ < eg, with £0 small enough, such that an initial datum 
starting on the (3-dimcnsional) surfaces W° , a = ±, defined as 



A CT (^a)=A' + UX,(^a), I°(<p, a) = ±J o9o ^2(1 - cos tp) + V£, (<p, a) , (1.2) 

evolves, when the time t — * ±00, tending to be confused with a quasi periodic motion on a invariant torus 
T(A'), with rotation vector 

w' = («i,a;i), wj =Vfii +77 J- 1 A[ , w'/^ V*^2 , (1.3) 
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and furthermore such asymptotic motion takes place with A moving quasi periodically with average A'. 
All this holds if u>' verifies the Diophantine condition 

\u' ■ u\ > C\v\- T , Vi/eZ 2 \{0}, (1.4) 

for C, t > (which may depend also on rj) . The values of e for which we shall be able to prove the above will 
be so small that the part of the stable and unstable manifolds with \ip\ < |-7r can be represented as a graph 
of A, I over a, tp. Hence we look, since the beginning, for invariant tori which have the latter property. 
The approach to the invariant tori, of the points that lie on their stable manifolds, will be exponential in 
the sense that their distances d(t) to the tori will be such that 

lim t _1 log d(i) _1 = g , g = g {e) = (1 + T(s,g Q )) g , (1.5) 

t — >+oo 

for a suitable analytic function r(e, go), divisible by e. We shall call g the Lyapunov exponent of the torus 
(it will depend on e as well as on the considered torus, i.e. on u/ and on r/). The exponent relative to the 
approach to the same torus along its unstable manifold (as t — > — oo) will be the same, by time reversal 
symmetry defined below. 

We fix throughout the paper r (r > 1) and we shall mainly study the dependence of e$, i.e. our estimate 
for the analyticity radius, as a function of n: e = £o(C,f]). 

1.2. Remark. The relation (1.3) between the value of the average action and the rotation vector is non 
trivial and it has been named in [G1,G2] (where it was pointed out) by saying that the tori of (1.1) are 
"torsion free" or "twistless". It is a remarkable symmetry property of (1.1), sec [Gl,Ge2,GGM3]. 

1.3. If e — the stable and unstable manifolds coincide (because the pendulum separatrix is degenerate); 
it is a degeneracy that is lost when e ^ and generically the manifolds will have only pairwise isolated 
trajectories in common, called homoclinic trajectories. 

Nevertheless time reversal symmetry and parity symmetry 2 hold for (1.1). If S* denotes the time evolution 
and the involution map i (composition of parity and time reversal) is defined by i((p,a,I,A) = (2tt — 
tp, —a, I, A), then iS* = S~ f i and there are relations between the stable and unstable manifolds that are 
preserved even for e ^ 0. Namely 

U+ (¥>,<*) =UX,(27r-¥>, -a), 
V£ (p, a) =V£(2tt -¥>,-£*), 

where care must be exercised because the manifolds contain two points over each a, cp. 3 Hence if ip ~ 7r the 
relations in (1.6) concern points that lie on different connected manifolds; to understand what happens one 
should try a drawing taking into account that the above representations are considered only for \tp\ < |7T. 
Looking at the manifolds at ip = ir, assuming their existence and that they are graphs above a, ip for 
\tp\ < |-7T, equations (1.6) imply that, fixed A', 

Q(a) = f U+, (n, a) -U^a) = -Q(-a) , (1.7) 
so that Q(0) = 0; but, in general, Q(a) ^ for a ^ 0. 

The function Q(a) is called the homoclinic splitting (or simply splitting) vector at ip = ir, and the determi- 
nant of the matrix with entries d ai Qj(0) (splitting matrix) is called the splitting. One can more generally 



Note that if s = they are graphs over a, tp for \ip\ smaller than any prefixed quantity < 2tt. 

2 The latter symmetry is due to the assumption of evenness of /. 

3 This is in fact already so for s = 0. 



27JVmjemtrel998 



2 



Draft #20 



consider Z_ = Z_(tp, a) = (Uj^, (tp, a.) — U^, (p, a), V^, (tp, a) — V^, (tp, a.)) which would be the splitting vector 
at p. Here and henceforth the vectors in will be denoted with an underlined letter (while the boldface is 
used for vectors in R 1 ); so far £ — 3, but shortly we shall consider £ > 3. The function Z_ can be written 
as the gradient of a generating function i.e. Z_ = (<9 V $, d a <&). This is a result due to Eliasson who points 
out that it follows immediately from the Lagrangian nature of the stable and unstable manifolds. It is a 
further symmetry property. 4 

The symmetry of (1.1) (hence the consequent oddncss of Q(a)) implies that there is one trajectory which 
swings through tp = tt when a is exactly 0: it tends to the same invariant torus as t — ► ±oo, provided the 
torus exists and its stable and unstable manifolds are graphs over a, p over an interval of tp greater than 

\tp\ < 7T. 

In this paper we prove the following result. 

1.4. Theorem. Given the Hamiltonian (1.1), given constants s,Sl > and given r\ > small enough, the 
following assertions hold. 

• There are invariant tori with rotation vectors u>' for all u>' verifying the Diophantine condition (1.4) with 
constant C = C(n) = Q,e~ S71 1/2 and \uj[\ £ [^QiT]? ,2Qirj?]. 

• Such tori exist for \s\ < eo — 0(rj 2 ) and for n small enough. 

• They can be parameterized by their average actions A'; the angular velocity is then given by the rotation 
vector lj' = (Oirp + nJ^ 1 A^, 2 ?7~2 ) (i.e. they are "twistless") and the Lyapunov exponents have the form 
g = (1 + r(e, go)) go, with T(e,go) analytic in e and divisible by e. 

• The parametric equations for such tori and for their stable and unstable manifolds ("whiskers") can be 
computed by a convergent perturbation series in powers of e around the unperturbed tori with the same 
rotation vector and their corresponding stable and unstable manifolds. 

• At the homoclinic intersection with tp — tt (existing by symmetry), between the stable manifold and the 

unstable manifold of each torus, the splitting is generically given by the Mel'nikov integral which is of order 
-i -- 

0(e 2 n~Pe~~ g « v 2 ), for e small enough, with [3 depending on the degree No in p of the perturbation f : one 
can take (3 = 2N — 1 and the asymptotic formula holds if \e\ < rf , ( — 2(iV + 3) and r\ is small enough. 

1.5. Remark. The novelty of the theorem is the "sharp" bound eo = 0(rj 2 ). If we "only" require 
eo = 0(r]2 + ) where |+ is any prefixed positive number > | the result is proved in [GGM3] (see also 
[CG] or [Ge2]). The improvement is made possible by the totally different technique used (with respect to 
[GGM3]): a technique that has interest in its own right and, we think, beyond the result itself. In fact the 
proof of the last assertion of the theorem is the content of [GGM2] , and the values of the constants (3 and 
C are taken from Appendix A2 of [GGM2] . 

1.6. Theorem 1.4 will be proved by a further extension of Eliasson's method, [E,G2,Gel,Ge2], for the 
KAM theorem. The following discussion will show the correctness of the intuition that "new" small divisors 
appear in the perturbation expansion at orders spaced by 0(?7 _1 ). So that the coupling constant is effectively 
0(e v ) and the analyticity condition is expected to be e v C(rf)~ q small (for some q > 0, determined as 
in the discussion in Remark 5.16, item (4), below). Hence the analyticity condition will be eC(n)~ qri small 
rather than the far worse eC(rf)^ q small, that is implied directly from lemma 1 in [CG] (where q — 6 is an 
estimate). 

In the one degree of freedom case the corresponding problem is studied in [N] : it is a problem that arises 
naturally in the context of Nekhoroshev theory. In our case the rotation vector is not one-dimensional, so 
that the cancellations between resonances typical of small divisors problems, [E,Gl,Gel,Ge2], have to be 
exploited in order to prove convergence of the perturbative series. The fact that the two components of the 
rotation vector (1.3) are so different in scale has the consequence that small divisors can appear only at 



4 It can alternatively be easily seen from the explicit expressions for the stable and unstable manifolds equations derived in 
[Gl], which also provide a general algorithm for constructing the function $ as a convergent series in e for e small; see [G3]. 
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high orders, so that the dependence of the radius convergence on the Diophantinc constant C(ji) is highly 
improvable with respect the "naive" one, as explained above: the proof of such an assertion is the subject 
of the present paper (as, in the weaker form, already of [GGM3]). 

1.7. The paper is organized as follows. In §2,3,4 the formalism is concisely illustrated and the graphic 
representations of the whiskers in terms of tree graphs is exhibited (for systems more general than (1.1); 
see (2.1) below). The analysis is brief but selfcontained, with references to [Gl,Gc2] only given for further 
insight and details. The basic formalism is in §2, then we work out in §3 two specific examples to explain 
the origin of the graphical interpretation, and in §4 we set up the general Feynman rules for evaluating the 
equations of the whiskers (and the splitting vector as a particular case) as a sum of quantities that can 
be elementarily evaluated. In §5 bounds are derived, assuring the convergence of the perturbative series 
defining the whiskers in the more general system in (2.1) below and leading to Theorem 1.4, when restricted 
to the Hamiltonian (1.1). 

The bounds are derived along the lines of [Gl,Ge2]: the main part is the derivation of the bounds for the 
part of the expansion corresponding to what we call the contributions due to "trees without leaves" : this is 
done fully and self consistently in §5 and in the related appendices. Once the bounds on the contributions 
from trees without leaves are established, which is the real difficulty, the same analysis can be applied to 
bound the other contributions. Since this is simply reduced, without any further technical problems, to 
the case of contributions from the simpler trees with no leaves we do not repeat this part of the discussion 
which is done in [Ge2] following the corresponding analysis done in [Gl,Gel]. 

To Appendix Al we relegate some technical details, while Appendix A3 concerns the cancellation analysis 
of [Ge2], needed in order to treat the small divisors problems, with more details with respect to the quoted 
paper. An original technical part is also in Appendix A2 and deals with the improvement of the dependence 
of the convergence radius on the Diophantinc constant C{rf). 

We do not comment here on the obvious relevance of the above results for the theory of Arnol'd diffusion: 
see [GGM3,GGM4]. 



§2. Lindstedt series for whiskered tori. 

We use the formalism of [Ge2] : it would be pointless to repeat here the technical work required to motivate 
the necessity or usefulness of the notations, and we cannot imagine that any reader may have interest 
in the matter that follows unless he has some experience with Eliasson's method, as exposed in [E] and 
complemented in [Gl,G2,Gel,Ge2]. The references to [Gl,Ge2] are given only to point at places where 
further details on the motivations of the assertions can be found. 

The following analysis innovates [Ge2] in §5 because of the extension of Siegel-Bryuno's bound described 
in Appendix A2 below: this section and the next two provide a self contained description of the graphical 
algorithm exploited in §5 and Appendix A2. 

2.1. In the following we shall consider a Hamiltonian ("Thirring model") more general than the one in 
(1.1), i.e. a Hamiltonian which couples a pendulum with £—1 rotators via a perturbation /i which is always 
an even trigonometric polynomial, 

1 I 2 

H = UJ - A + ' A + 2J- + Jo9 ° + eJo9 ° MP* a ) + Jog o T( £ '5o) M<P) > (2-1) 

where (a, A) el' 1 xl' 1 , (ip, I) e T 1 x R 1 , J > 0, J is a diagonal matrix, with < det J < +oo, and 

/i(p,a)= ]T ]T flS™^, fl=f\_, (2.2) 

M<«0 \f\<N 
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/„(¥>) =(«»¥>- 1)= £ f^^\ 

|n| = l 

with = (^o; 17 ) = ("j ") S Z x Z £ 1 and |z/| = l^jli we prefer to consider the Hamiltonian (2.1) 

with I arbitrary because the Lindstedt series analysis holds for any I > 1. So that the value £ = 3 and the 
existence of three time scales will be used only to obtain the second bound in (5.13) below. 
The last term in (2.1) could be put together with the free pendulum potential Jo5o ( cos ¥~ 1) tnus modifying 
the "gravity acceleration" g 2 into (1 + 7(5, go)) g^: the term with 7(5, go) — SfcLi lk(go)s k is added because 
we follow here the approach in [Ge2]. We show that, given s, 0, 77, u/, with ry small enough and w' verifying 
the Diophantinc condition in Theorem 1.4, then one can fix 7(5,50) so that, for |e| < 0(r] 2 ), there is an 
invariant torus with average (over time) action A', with the properties in Theorem 1.4 and with Lyapunov 
exponent go and rotation u/ = u> + J A'. In other words by adding a counterterm to the Hamiltonian 
(1.1) one gets a new Hamiltonian system, (2.1), with an invariant torus with rotation u>' and Lyapunov 
exponent exactly equal to the prefixed go (see also §2.7 below). 

We further show that, fixed u/, 7(5,50) is jointly analytic in 5,50, if 50 varies near a prefixed c/ > 0. 
Going back to the original Hamiltonian (1.1) we therefore set 5q = 5q (1 + 7(5, 5 )) and we can invert the 
latter relation as 5~o = (1 + r(e,5o))<?o f° r £ small enough (this will mean: for s| < 0(n 2 )). Hence by 
interpreting 50 in (1.1) as g 2 , (1 + 7(5,5,3)), so that (1.5) holds, we obtain Theorem 1.4 as a corollary of the 
above statements. 

Of course a similar proof could be done without first fixing the Lyapunov exponent g and then inverting the 
relation between the "dressed exponent" g Q and the "bare" one g . But it is well known, from the analogous 
problem in renormalization theory, that it is wiser technically and conceptually to work, in perturbation 
theory, with prefixed physical quantities (i.e. dressed ones). The idea that perturbation theory would be 
simpler, in the technical estimates, is the key idea beyond [Gl, Gel] that is introduced in [Ge2]. 

2.2. From now on let us denote by a the initial value of the rotators angles (i.e. at time t = 0). We 
define by XJ(t; a), j = 0, . . . , 21 — 1, the values of the variables at time t that are reached from initial data 
X a (0;a) = (tt, a, I a (0; 0),A CT (0; 0)), with the given a, with tp = n and with I, A such that X CT (0;a) is 
on the stable (a = +) or unstable (a = — ) manifolds of the invariant torus that we are searching for; the 
convention on the labels of X is that 

XS = <p°; XJ = a°, forl<. 7 <£; 
XI - I a ■ XJ = A° , for e < j < 21 . 

All functions in (2.3) depend on t and a (the symbols I a (t;a) and A a (t;a) should not be confused with 
I a (ip,a) and A a (ip, a) defined in (1.2): in the following no ambiguity can arise as the quantities I a and 
A CT will be used always with the meaning in (2.3) and as functions of t, a). 

This is a parameterization of the stable and unstable manifolds in terms of a, t where a. is the value of the 
angular coordinates at the moment in which <p = ir, and t is the time elapsed since. The parameterization is 
different from the one in terms of a, ip in (1.2) unless, of course, it is ip = tt and correspondingly t = 0. Hence 
the splitting vector (1.7) at <p — tt can also be written Qj(at) = X^(0; a) — XJ(Q, a), j = I + 1, . . . , 21 — 1. 
Note that we do not need to consider explicitly the splitting in the /-coordinates because, by energy 
conservation, they are functions of ip, a, A . 

Let A' be given and let u>' in (1.3) be Diophantine with constants C — C(n),T > 0; see (1.4). We look 
for an invariant torus and for its stable and unstable manifolds with the property that the quasi periodic 
rotation on the torus takes place at velocity <*/ and, at the same time, the action variables oscillate with an 
average position A'. 

• Before proceeding we remark that the above two requirements may seem contradictory as there may seem 
to be no reason for being able to prescribe simultaneously the "spectrum" us' and the "average action" A' 
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of the invariant tori. In fact this property of "twistless" motion on the tori or of "absence of torsion" is very 
remarkable (see the Remark 1.2 and [Gl]): it will appear as due to the special symmetries of the system 
(2.1) and to the separation of the energy into a quadratic part involving actions only and an angular part 
involving only the angles. 

Note also that we could confine ourselves to study the torus with average position A' = 0, as in [Gl,Ge2], 
because any torus can be reduced to that one through a trivial canonical transformation (a translation in 
the action variables). This explains why in the quoted papers only the torus covered with rotation vector u> 
is explicitly considered: however in the following we consider also A' ^ 0, as we are interested in showing 
the abundance of such tori in phase space (see the Remark 1.5). 

The quantity XJ(t; a) can be graphically represented as sum of values which can be associated with tree 
graphs, that we shall call "Feynman graphs" or "trees" tout court, see Fig. (2.4) below. The trees are partially 
ordered sets of points, called nodes, connected by unit lines, called branches, and they are "oriented" towards 
a point called root, which is reached by a single branch of the tree. Given two nodes v and w of a tree, we 
say that w precedes v (w < v) if there is a path connecting w to v, oriented from w to v. With an abuse 
of notations we shall sometimes consider a tree as the collection of its nodes, sometimes as the collection of 
its branches and sometimes as the collection of both nodes and branches. The root will not be considered 
a node. 

A typical tree considered below can be drawn as in Fig. (2.4): the labels meaning and the caption of such a 
drawing (which has to be interpreted as a mathematical formula) will be elucidated in the coming sections. 



root 




A tree i9 with m = 12, and some labels. The line numbers, distinguishing the lines, and their orientation pointing at the root, 
are not shown. The lines length should be the same but it is drawn of arbitrary size. The nodes labels S v are indicated only 
for two nodes. 

The branch starting at the node v and linking it to the uniquely determined next node (or to the root), 
which we call v' , will be denoted by A„: there is a unique correspondence between nodes and branches 
starting at them. We shall say that A„ exits from v and enters v'; given a node v we shall say that a branch 
A pertains to v if either A enters v or A exits from v; e.g. in Fig. (2.4) the line v\Vo = X Vl "exits" v\ and 
"enters" vq, hence it pertains to both. 

In [Gl] two expansions are considered for the functions Xj(t;a) representing the stable and unstable 
manifolds: one of them is used to exhibit cancellations taking place at all orders in the sums that express 
the coefficients of the power series in e of the splitting vector, [G1,BCG,GGM2]; it is somewhat more 
involved than the other one that is convenient to just discuss convergence of the perturbation series for the 
splitting vector and that we shall use here. This is the reason why (as in [Ge2]) we shall not have trees 
whose lowest nodes carry a graphical decoration called form factor, or fruit in [G1,GGM2]. Nevertheless 
some of the nodes will still have a particular structure: to characterize them we introduce, below as in [Ge2], 
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the notion of "leaf" , which is related to the notion of fruit in [Gl], from which it differs (and it, even, differs 
slightly from the similar notion of leaf in [Ge2]), see below for the motivation of the name. 

2.3. As mentioned the drawing Fig. (2.4) has to be regarded as a mathematical formula expressing a 
function of the labels and of the topological structure of the trees. We now prepare the notation for the 
definition of "value" of a tree (following [Ge2]) (see [Gl] for a simpler case): the derivation is not difficult 
but somewhat long and unusual for the subject (the breakthrough work [E] still does not seem to be well 
known in its technical aspects!). We discuss it in detail not only for completeness but in the attempt 
to clarify a construction that has generated quite a few new results starting from the work of [E], see 
[G1,GGM2,BGGM]. 

Let us consider the unperturbed motion X (i) = (ip (t),a + u)'t, I (t),A'), where (tp (t),I (t)) is the 
separatrix motion, generated by the pendulum in (2.1) starting at t = in ip = n, A = A', I = — 2Jo50j 
so that (p a (t) = 4arctane~ 90 *. Let X a (t;a), a = signt = ±, be the evolution, under the flow generated by 
(1.1), of the point on W° which at time t — is (it, a, / CT (a, tt), A a (a, nj), see (1.2); let 

X a (t) = X a {t-a) = ^Tx ha (t-a.)e h = X ha {t)e h , a = ±, (2.5) 

h>0 h>0 

be the power series in e of X a , (which we want to show to be convergent for e small); note that X 0a = X° 
is the unperturbed whisker. We shall often omit writing explicitly the a variable among the arguments of 
various a-dependent functions, to simplify the notations, and we shall regard the two functions X h<T (t), as 
forming a sing le function X h (t), which is X h+ (t) if a = +, t > 0, and X h -(t) if a = -, t < 0. 
Components of X will be labeled j, j — 0,...,2£— 1, consistently with (2.3), with the convention that 

X d = A_ describes the coordinate ip, (X,)j = i ; ...^_i d = X| describes the a coordinates, Xg d = X + describes 

def 

the / coordinate and (A J )j = £ + i ! ..^2£-i = X-f describes the A coordinates, 

X = f {X 3 ) 1=Q _ 2 ^ (X_,X X ,X + ,X T ) , (2.6) 

i.e. we write first the angle and then the action components, first the pendulum and then the rotators. The 
t ("up") and i ("down") labels recall that the components with labels j (0 < j < £) have "lower" index 
than the variables with labels T (I < j), which have a "higher" index (a mncmonically useful fact, on first 
reading at least). 

Inserting (2.5) into the Hamilton equation associated with (2.1) we get that the coefficients X ha (t), h>l, 
satisfy the hierarchy of linear equations 

j t X ha {t) = L(t)X ha {t) + F ha (t) , (2.7) 

with F ha (t) a 2^-vector and the 2i x 2£-matrix L(t) is 



L{t) 



( J " ] 



ll J cos (p°(t) 

V o o o 




(2.8) 



For instance, F la (t) is a 2£-vector with the first 0, ...,£— 1 components vanishing (a consequence of 
the assumption that the perturbation only depends on the angular variables), with the ^-th component 
equal to —jQgQd v fi(ip°(t),a + uj't) + Jo5o7i(.9o) sm.(ip°(t)) and with the remaining components equal to 
-JoSo0a/i(¥>°(t),a + w't). 

In general F ha depends upon X°, . . . , X h ~ la but not on X ha . The entries of the (2£ x 2£) matrix L have 
different meaning according to their position: the O's in the first and third row are (£ — l)-(row)-vectors, 
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the 0's in the first and third column are (£ — l)-(column)-vectors, and the O's and J -1 in the second and 
fourth column are (£ — 1) x (£ — l)-matrices, while the O's in the first and third columns are scalars (as J^ 1 
is). The perturbed motions will be described by dimensionless quantities S, $: 

Xf CT =£f, 0<j<£-l, X^=J g E^, £<j<2£-l, 
F'f CT = .hgl 4>>^ , F$° = J g 2 $ h + ° , 



The simple form of the Hamiltonian equations for ip,a, namely ip — J 1 I, a = u> + J 1 A implies that 
®j a = F j ka = 0, for j = 0, . . . , f - 1. For instance 

$ lCT = (0,0,-^/ 1 (^°(i),a + o;'i)+7i(3o)sin(^oW),-5 Q /i(^ (t),a+a;'i)) . (2.10) 

Given the form of L(t) and the vanishing of the first £ components , of F ha , for h > 1, the above 
hierarchy of equations (determining the stable and unstable manifolds) takes the form 

L d r^ha ,.0 r^ho , xJio ^ d ~ho ff.Ua 

5o dt + 5 o 1 

Id Id [ } 

And, for all h > 1, we can easily write (via Taylor expansion and order matching) the following formula for 
in terms of the coefficients S°, . . . , 5' 1_1<J and of the derivatives of /o and /i = /, sec (2.2). The first £ 
components of <fr hcr vanish, as said above, = 0, <&^' J = 0, and 

£-1 mi 

*y=- e (9«AUv o , a + a,'*) e n n . 

|m|>0 WV.k-iPl'H 



m 



$f ee - ^ (d^ (<p)u<p°) yi n- ff + ( 2 - 12 ) 

-EE>w(VoM) H (/) E II 3 - CT + 

P=l|m|>0 (hJ)m,h- P J = 1 

£-1 m; 

- E(vi)m(v o .«+«'*) e nn^ 

H>0 (hpm,h-ii=0j' =1 

where (G)m(-), with G e {d v fo,d a fi,d v fi}, and (hj)rn, q , with /i*. > 1, are defined as 

,f)rn f)mi a m e-i sy 



— V molmi! ... m£_i! / 



£—1 mi 

(/ij)™,, EE (hi . . . ,h° mo ,h{, . . . , fc mi , . . . , h{~\ With ^ ]T ft* 

»=o j=i 



(2-13) 



and mi > 0, m = (mo, . . . , m^_i), |m| = X^i=o m «- Note that the first two sums in the expression for <J>^ CT 
can only involve vectors m with rrij = if j > 1 (so that |m| = m ), because the function / , see (2.2), 
depends only on </? and not on a. The evolution of S' 1 is determined by integrating (2.8), if the initial data 
are known. The h — 1 case requires a suitable interpretation of the symbols, given explicitly by (2.10). 
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Elementary quadrability of the free pendulum equations on the separatrix leads to the following expression 
for the "Wronskian matrix W(t) of the separatrix motion for the pendulum appearing in (2.1), with initial 
data at t — given by <p = ir, I — —2g a J a , i.e. = —2. The matrix 

w{t) = (woo(t) w e(t)\ (214) 



wto(t) w u (t) 

is defined to be the solution of the linearization of the free pendulum equation around the separatrix solution, 
with data W(0) = 1 and with J = 1 (because we use dimensionless solutions S, see (2.11)): 

/ 1 w(t) \ 



cosh 2 got I 4 cosh 2 got 



J cosh g t J cosh 50 ^ 



The evolution of the /, <p components, i.e. ^ a with j = 0,£ (also identified with the components with 
subscripts ±, see (2.6)) can be determined from W(t), by integrating (2.7) for the and £ components, to 
be 

' -^ha \ / n \ rsot 



W(t) ( s ^° (0) ) [J W ~^ T ) ($f(r)) d9 ° T ' (2 ' 16) 

Thus, denoting by Wij — 0,£) the entries of W(t), (2.16) becomes, for h>l, 



, r9oi , /■Sol 

E h _ a (t) = w oe (t)(Z h + ° (0) + j Woo (r)$f (r) dg r) - w m {t) J w m (t)^ (t) dg r , 

/ /-Sot . rgot 

E h /(t)=w u (t)[E h /(0) + J w oa (r)^(T)dg r) - w m (t) J w M {r)$ h + ° (r) d g r , 



(2.17) 



having used that S_' CT (0) = because the initial datum for tp is fixed and £-independent. Likewise integration 
of the equations (2.11) for the |, 1 components yields, for h > 1, 

rfo* s /-so* 



r / /"Sot . /-got 

Sf CT (i) = J' 1 Jo [.g i(sf CT (0) + jf dg r) - J g r^ CT (r) dg T 

/ r 9ot \ 

B*°(t) = (s^(0) + J **"(r) dg r) , 



having used that the S|" T (0) = because the initial datum for a is fixed and e-independent. The equations 
(2.17), (2.18) can be used to find a reasonably simple algorithm to represent the whiskers equations to all 
orders h > 1 of the perturbation expansion. 

2.4. The initial data in (2.17), (2.18) have to be determined by imposing that the solutions (to all orders) 
become quasi periodic as t —* a oo. This is quite easy and (as to be expected) this condition is simply that 
S^_ cr (0), 3^(0) are determined by imposing that the integrals in parentheses become integrals between a oo 

and t, i.e. (0) = j° aao ... and 3^(0) =J° 00 ...; see below. 

However the latter integrals arc no longer necessarily convergent properly (a few examples suffice to see 
this); hence one has to go carefully through the process of imposing the correct asymptotic behavior in order 
to see what is the meaning to be given to such integrals J* . The analysis can be found in [Gl, Ge2]. The 
result is that all expressions under integral sign can be written as sums of functions that are rather special, 
namely 

M(t) = a x fa *) 3 e *>'-vt-pgo*t ; ( 2 .19) 
j! 
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with \, j, u,p integers and p > — 1 (see below), so that one has 

oo oo 

~ ha (t) = X) Z ha (is,p)e i "'- ut - p9 °' Tt , <S> ha (t)= Y X $> h < y (v,p)e lU3 '- vt - p9a,Jt , (2.20) 

where we explicitly write down only the dependence on v and p (clearly also the fixed constants like 
J, J , g , . ■ . enter). 

The series turn out to be convergent for at > 0; however their sums have no singularity at t = and can 
be anaytically continued for at < (i.e. x > 1). More precisely the functions that one has to integrate are 
contained in an algebra M. on which the integration operations that we need can be given a meaning. 

Definition ([Gl]). Let M be the space of the functions of t which can be represented, for some k > 0, as 



k 



M(t) = M°(x,LJt) , x = e- a9at , a = signt , (2.21) 

3=0 J - 

with Mj(x,ij>) a trigonometric polynomial in if) with coefficients holomorphic in the x-plane in the annulus 
< \x\ < 1, with possible singularities, outside the open unit disk, in a closed cone centered at the origin, 
with axis of symmetry on the imaginary axis and half opening < and possible polar singularities at x = 0. 
The smallest cone containing the singularities will be called the singularity cone of M. 



raot 

(TOO 

f 9 ^ M(r)dgoT, is simply the residuum at R = of the analytic function 



The proper interpretation of the improper integrals J 90 M(r)dgoT, which henceforth will be denoted by 

ot 

CTOO 



IrM = / e- R3aaz M(z)dgoz , (2.22) 

J aoo+i6 

(where 9 is arbitrarily prefixed) which is defined and holomorphic for Re R > and large enough, i. e. 

got 

IMit) ee j dgor M(t) = f j ^ l R M{t) . (2.23) 



CTOO 



By linear extension this defines the integration of function in A4 for \x\ < 1. The analyticity in x around 
x = ±1 and the remarks that -^lM(t) ee M(t), i.e. TM(t) = IM(i') + J^°* dg r M(r), so that lM(t) is 
a special primitive of M(t) (at fixed a), allow us to analytically continue the result of the integration to a 
function in M . The operator I maps the algebra M into itself because one checks that on the monomial 
(2.19) one has 

[9o a (7+i)! ■ otherwise , 

showing, in particular, that the radius of convergence in x of TM, for a general M, is the same as that of 
M. But in general the singularities will not be polar, even when those of the MJ's were such. 
We shall see that the cases \p\ + \v\ = do not enter in the discussion (a feature of the method of [Ge2]). 
The complete expression of X ha (t) becomes 

Z h _°(t) = w oe (t)l(w 00 ^)(t) - w o(t)(l(w oe <P h + °)(t)-l(woi<P h + °W)) = f '0(*?)(t) , 

S^(t) = J- 1 J (x 2 (4>^)(t) - ^(^YW)) d = Z 2 (&i a (t)) , (2-25) 

(t) = w u (t)J(woo<P h + a )(t) - w m (t)(l{w M $ h + °){t) - l(woi^W)) d =0+(^)(t) , 
S^(t)=l(^)(t), 
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where O, + ,I are implicitly defined here (and l 2 is 1 applied twice); and ^ ho ', <& ha = (0, 0, #^ CT ) are 

introduced in (2.9). While 'E J ha has non zero components over both the angle (j — 0, ...,£— 1) and over the 
action (j = £,... ,2£ — 1) components, the <f>' iCT has, as already noted, only the action directions non zero; 
the notation CT means the limit as t — > from the left (a = — ) or from the right (a = +), but below we 
shall drop the superscript on (always clear from he context because it is the same as the superscript a of 
the functions 'El 1 ' 7 ). Furthermore, with the definitions (2.20) of 4»j (v,p) one finds also the property (with 
the notations in (2.1)) 

<2^ CT (0,0) = , (2.26) 

for all h>l. 

We shall repeatedly use that in order to compute Ej 17 we only need Ey a with < j' < i (i.e. only 
S| CT ) and hi < h. This follows from (2.25) and (2.12): whether we want to compute an "action 
component" (S^" 7 , j > I) or an " angle component' (S^" 7 , j < £) of E ha , we only need the angle components 
of lower orders, i.e. a with hi < h and j' < £. 

2.5. The linearity of the last of (2.25), together with (2.26) and the ^-dependence of & ha (t) in (2.20), 
implies that the prefixed value A' has the interpretation of average action of the quasi periodic motion on 
the invariant torus to which the trajectories that we study asymptote; see the third statement in Theorem 
1.4. This corresponds to the identity of [CG] (see, in the latter reference, the first of (6.34) and its proof in 
Appendix A12) that follows from the symplectic structure of the equations of motion, according to a well 
known argument going back to Poincare, [P], discussed also in [E,CZ]. It is a property that generated the 
qualification of "twistless tori" given in [Gl] to such tori: the "dispersion relation" linking the frequencies 
to the average actions does not change or is not twisted when the perturbation is switched on. This is a 
property, established in the present context in (33) of [Ge2] , that can be ultimately traced back to the fact 
that in the above models the twist condition is not needed for establishing a KAM theorem. 

2.6. By combining (2.25) and (2.12), (2.13) (and recalling (2.9)) the representation in terms of trees is 
immediate; the integrals in (2.25) and the lower order X h in (2.12) become recursively multiple (improper) 
integrals over dummy "time" variables. 

In this operation each function (—d a fi(ip°(t),a + uj't))m and (—d<pfo(ip°(t)))m is expanded as a linear 
combination of monomials M(t) having the form a x (agoty (j\)~ 1 x 1l e l " vt with x = e~ 9a<7t ; see (2.19). 

The form of (2.12) shows that the integrations occur in a hierarchical order: hence one can describe them 
by a tree. The integrands can be identified by attaching to each node of the tree suitably many labels. We 
shall first illustrate the construction of the trees via two examples (in §3 below) : this can be useful in order 
to understand the general case (see also [Gl,Ge2]). 

2.7. We shall establish, also recursively, that E ha will be expanded in monomial like (2.19) with j = 
and p > 0, so that at t — > ±oo the quantities E ha will approach exponentially fast quasi periodic functions 
describing the motion on the invariant torus. The approach will be proportional to e~ 90 '*' or to a higher 
power of this quantity. This, together with the remark that at order (i.e. on the unperturbed motion) the 
approach is precisely proportional to e~ 90 ^ (in the I,tp coordinates), will imply that at least for j = 0, £ 
(and "generically" also for the other coordinates) 

km -iloglSJWr^so, (2-27) 

t^oo at 

i.e. that the Lyapunov exponents of the torus are ±go- 

Before stating the general graphical rules to represent (2.25) in terms of explicitly performed integrals, we 
discuss in detail two examples: understanding them facilitates enormously, we think, the understanding of 
the general cases which will be exposed referring to the examples to make it more concrete. 
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§3. Two examples of the trees construction. 

3.1. We discuss how to make more explicit (2.25) by performing two "third order" examples. The first 
order reduces trivially to the first order formulae (Mcl'nikov integral); the second order is also a bit too 
simple and is left to the reader: the first two orders will be, of course, implicitly done below, because to 
compute the third order one needs the first and second too. 

To third order the last line in (2.25) gives S^ a (t) = T(^ a )(t), where ^ a can be expressed through the 
first equation in (2.12), so that, for j = I + 1, . . . , 21 — 1, one has 

e-i 

(3.1) 



j e-i e-i 
- E d a ] d ap d aq fiE 1 p a E 1 q ' J - E 9 " 3 5 "p/i S p' T 



p,q=l p=l 



where S lcr and S 2cr can be written by using once more (2.25) (the first two lines only as per the general 
remark in the last paragraph of §2.4). 

We consider explicitly two contributions to S^ a {t). Recalling that a = + corresponds to the stable manifold 
and a = — to the unstable one, the first will be 



got 

^-f dg a T Vo (-d ajapa Ji((p°(T Vo ),a + u't Vq ) E 1 / (t Vo ) E 1 ^ (t Vo ) , (3.2) 



arising from the fourth contribution in the r.h.s. of (3.1). The contribution (3.2) can be written more 
explicitly, by using again the expression for S'|' T in (2.25), as 



got 

i j dg T Vo {-d ajapaq fi) (t„ ) T?{-d ap fi{T Vl )){T V0 ) ^ (-d ap .fi{T v J){T Vo ) , (3.3) 



where the X operations involve, see (2.25), integrations over variables that we can call t Vi ,t V2 and the 
derivatives of /i are evaluated at (ip°(T Vn ), a + Lj'T Vn ), n = 0,1,2. Such variables have been indicated 
explicitly using the abbreviated notation (r Vn ) and with a obvious abuses of notation (they should not 
appear at all, except t Vo , being dummy). 

The second example is obtained by considering the contribution with h® = 2 from the first line of (2.12), 
i.e. the second contribution in the r.h.s. of (3.1), 



got 



d ajV fi) (t Vq ) z?f{T Va )dT Vo , (3.4) 



still imagining the derivatives of /i evaluated at ((p°(T Vo ),a + lj't Vo ). This contribution will be the sum 
of several terms, because E 2 f(T VQ ) has to be expressed by using (2.25) and (2.12). One of the (many) 
contributions will be 

gat 

\ j dg Q r Va {-d ajV h) (t V0 ) o( - d*f (T Vl )O( - d v h(T V2 ))(T Vl )0( - d v h{T V3 )){T Vl )){T VQ ) , (3.5) 
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where the O operations involve, see (2.25), integrations over variables that we can call t Vi ,t V2 ,t V3 and the 
derivatives of /o, /i are evaluated at (<fi°(T Vn ), a + u)'t Vti ), n = 0, 1, 2, 3. Such variables have been indicated 
explicitly with the same abuse of notation as above; and the dependence on r of the derivatives of / , /i has 
again been simply denoted by adding the symbol (r Vn ) instead of the full argument (tp°(T Vrl ), a + uj'r Vn ). 
A complete representation of the above two contributions to Ej(t) is given, with enormous notational 
simplification, by the following trees: 




where the labels on the nodes v are denoted 5 V , j v and those on the lines A„ are denoted j\ v . 
The label 6 V = 0, 1 on the node v indicates selection of fs v , i.e. of /o or f\, the label j v denotes a derivative 
with respect to (p if j v = I or with respect to atj v if j v = £ + 1, . . . , 2£ — 1. For the label j\ v associated 
with the branch A„ following v, one has j\ v = j v — £ for all v except for the highest node vo, for which 
one has j\ v = j Vo . In the examples above, (3.3) and (3.5) correspond, respectively, to the first figure in 
(3.6) with j Vo = j,j Vl = p + £, j V2 = q + £ and to the second with j Vl = j V2 = j V3 = £,j Vo = j, (hence 
h vi = j\ V2 = h V3 = 0, jv = j)- In the examples the labels p, q correspond to d ap ,d aq in (3.3). 

3.2. Remark. The exception for the meaning of j\ VQ is convenient, in the above cases, as the integration 
over t Vq differs from the others: the inner ones evaluate E^ CT for j = 0, ...,£— 1, because the functions /o, /i 
only depend on the angle variables (see the last paragraph in §2.4); the last integral, however, evaluates in 
the examples a component of a^ a (which is labeled j = £+ 1, . . . , 2£— 1), but, in general, j can be any value 
j = 0, . . . , 2£— 1. Note that this is not so for the inner labels j\ which must be angle labels j\ = 0, ...,£— 1. 
So, in general, we shall have that the value of a tree with j\ vo = j contributes to S^ CT . 



§4. Trees and Feynman graphs approach to whiskers construction: the general 
case. 

We now proceed to describe the general case. 

4.1. To compute the splitting vector we only need to consider the variable t equal to 0. However we shall 
be also interested in E ha (a 7 t) for at > 0, for instance in order to study how fast the invariant torus is 
approached by the motions on its stable and unstable manifolds (to obtain its Lyapunov exponent). Hence 
it will be natural to attribute the label t to the root: this will also remind that the integral over t Vo has 
to be performed between troo and t, (the value a = — corresponds to the unstable manifold and the value 
a = + corresponds to the stable one). Since we shall never consider the stable manifold for t > or the 
unstable for t < the value of a will be the same as that of the sign of t. 

We shall be interested in computing not only (0; a) — A' (or (t; a) — A'), as in [GGM2], but, more 
generally, X a (t- 1 a.) — X°(t; a), with a — signi, (here X° denotes the unperturbed motion). 

In general the rules to express X a (t; a) — X°(t; a) as sum of "values" associated with trees will be described 
now, assuming that the reader follows us by applying and checking them to the special cases (3.3), (3.5), 
illustrated in (3.6). 

The reader might be helped in following the construction of the algorithm to express the stable and unstable 
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manifolds below, by keeping in mind that we simply decompose the (quite involved and recursively defined 
by (2. 25), (2. 12)) expressions for the whiskers, so far obtained, further. 

The purpose being of reducing their evaluation to very elementary algebraic operations: ultimately just 
products of simple factors associated with the nodes (and their labels) of a tree, that we shall call "coupling 
constants", and of factors associated with the branches (and their labels), that we shall call "propagators", 
each of which can be trivially evaluated and trivially bounded. 

The reader familiar with Quantum Field Theory will realize the striking analogy between the algorithms 
discussed below and the Feynman graphs: in fact a "tree" will turn out as an analog to a (loopless) Feynman 
graph and very likely it is a Feynman graph of a suitable (non trivial) field theory. Our analysis amounts 
to a renormalization group analysis of it and it partially extends, to the case of the theory of the stable and 
unstable manifolds of hyperbolic tori in nearly integrale systems, the field theoretic interpretation already 
discussed in detail in previous works, see [GGM1] and appended references, in the study of KAM tori. 

• To each node we attach an order label S v = 0, 1, see Fig. (3. 6), and a corresponding function fg v : if a node 
v bears a label S v = 1 the associated function is /i and if it bears a label S v — it is /o- 

• To each node v of a tree see the Figure (2.4) above, we associate an integration time variable t v and an 
integration operation, which corresponds to X 2 or O if the node is not the highest node vo and to I 2 or O 
or 1 or + if the node v is the highest, i.e. v = v . This is so because in the first case (a "lower node") one 
must use the first two equations in (2.25) because in (2.12) only angle components of X ha appear, while in 
the second case (that of the highest node) one can use all of (2.25) since we can evaluate either an angle 
coordinate S^ cr (a, t), j < £, or an action coordinate, j > £. 

When v < v n the choice between the two possibilities will be marked by an action label j v associated with 
each node: if j v = £, v < vq, then we choose O, if j v = I + 1, . . . , 21 — 1, v < vo, we choose I 2 . 

• When v is the highest node v , there arc therefore more possibilities: to distinguish between them we use 
the action label j vo and the branch label j\ VQ , which can be equal either to j vo or to j vo — I. So when v = v a 
and j Vo — £, we choose O if j\ v = j Vo — £ = and + if j\ v = j Vo = £, sec (2.25), while when v = v n and 

2 

j Vo > £, we choose 1 if j A „ = jv - £ and T if j XvQ = j Vo , sec (2.25). 
As said in Remark 3.2, the meaning of the branch label is that a tree with j\ = j is a graphic representation 
of a "contribution" to S^ CT . Therefore if j\ > £ we call the branch an action branch and if j\ < £ we call 
it an angle branch. 

In the first of the figures in (3.6) integrals with respect to the nodes vi,v 2 arc of the type J 2 . In the second 
the integrals over the T„ n , n = 1,2,3, are all of the type O. In both cases the integrals over t vo are of the 
form X because we fixed j\ vo = j > £ to be an action label. We can associate a branch label j\ v also to 
the inner branches with v < v n : however in this case one has necessarily j\ v = j v — £ because the inner 
branches necessarily represent angle components E^ CT with j < £, see (2.12). Hence no information is carried 
by such labels that we define only for uniformity of notation. The latter labels appear in (3.6) as j,p, q in 
the first tree and as j, 0, 0, in the second one. The labels j\ corresponding to the lines pertaining to a 
node v determine, as in the examples of §3, which derivatives have to be taken of the function fs v which is 
associated with v. each line A„ with label j\ v corresponds to a derivative of fg v with respect to ip if j\ v — 
or to ctj Xv if < j\ v < £. 

• The integrations over the node times t v must be thought of as improper integrals, in the above sense, from 
(too to either t v i or because (2.25) contains various integrals between such extremes. It will be convenient 
to distinguish between such terms. 

This can be easily done by adding, on each node, a new label p v also equal to 0, 1: if p v = 1 this means, 
naturally, that in the evaluation of the integration operations relative to the node v we select the terms that 
correspond to integrations between croo and tv while if p v = we select the integrations between ctoo and 
0. We shall imagine that also the highest node carries a label p VQ which is 1 necessarily if we consider only 
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(because this implies that the function associated with the highest node must appear differentiated 
with respect to a a-component, see above and (2.12)), but which could be for the other components of 
S(i). Recall also that in this case t v > = t, see (2.25). 

• We remark that the hierarchical structure of the integrations implies that if p v — 1 and if v' is the node 
immediately following (in the direction of the root) v along the tree then one has t v > t v > if a = + and 
t v < t v i if a = — , while t v ,t v > have the same sign but are otherwise unrelated if p v = 0; see (2.25) and 
check this in the examples. 

Besides the labels already introduced also the labels p v = 0, 1, just described but not shown in (3.6), should 
be imagined carried by each node. 

• Given a tree labeled as above we pick up the nodes v with p v = which are closest to the root, and 
consider the subtrees having such nodes as highest nodes. We call each such subtree, i.e. each such node 
together with the subtrees ending in it (and its labels), a lea}. 5 The name is natural if one imagines to enclose 
the part of the tree including the node v itself and half of the line A„ into a circle (or, more pictorially, into 
a leaf shaped contour): hence, to whom tries the drawing, it will look like the venations of a leaf and the 
half line outside it will look like its stalk. 

• All nodes which do not belong to any leaves will be called free nodes; they carry, by construction, a label 
p v = 1, so that the corresponding time variables are hierarchically ordered from the lowest nodes up to the 
root: i.e. if w < v then t w < t v if a = + and t w > t v if a = — . Given a tree "3 let us call df the set of free 
nodes in ■&, and call 6l the set of highest nodes of the leaves. 

• Each fs v function, associated with the node v with order label S v , can be decomposed into its Fourier 
harmonics. This can be done graphically by adding to each node v a mode label v_ v = (vq v ,v v ) = (n v ,v v ) E 
Z , with \v v \ < N and \n v \ < Nq, that denotes the particular harmonic selected for the node v. If 
(jv,S v , p v ,E v ) are the labels of v we will associate with v the quantity e i{"- v vT V +n v y (t„)) mu ltipHed by 
appropriate products of factors in v (one per (^-derivative) and iv v j (one per aj-derivative, j = j\ v ). 

If the mode labels v v are specified for each v we shall define the momentum u{v) "flowing" on a branch A„ 
as the sum of all the angle mode components v w of the nodes w preceding the branch, with v included, 

v{v) d M Y, "">■> 

the momentum v(vo) flowing through the root branch will be called the total momentum (of the tree). 
We shall define also the total free momentum of the tree as the sum of the mode labels of the free nodes: 
more generally, for any free node v we can define the free momentum flowing through the branch X v as 

For instance in the above examples the two contributions (3.3), (3.5) (represented by figure (3.6)) are 
decomposed into sums of several distinct contributions once the p v and the mode labels v_ v are specified. 
Likewise we can look at a leaf as a tree: the momentum u 1 flowing through its stalk will then be called the 
internal leaf momentum. Note that its value gives no contribution to the total free momentum of the tree 
to which the leaf belongs. 

• The free momenta will turn out to describe the harmonics of the time dependent quasi periodic motion 
around the invariant tori, while the Fourier expansion modes of X ha (t; a) as a function of a are related to 

5 This definition is slightly different from the one given in [Ge2], where the leaf represents a collection of trees and, as explained 
below, is related to a resummation operation (see also comments in §4.2, item (v), below, and (4.27)), that we do not consider 
here. 
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the sum of the free momenta and of all the internal leaf momenta. This is an important difference: it is a 
property stressed in [Gl] where it is referred as "quasi flatness", source of the main difficulties and interest 
in the theory of homoclinic splitting, see [G1,GGM2,GGM3,G3]. 

4.2. The trees contributions of the examples of §3 will be sums over the various labels of "values" of trees 
decorated by more labels: 

got 

\ j dg Q r Vo {-iu Voj ){iu VoP ){iv Voq ) fl^ e *K„-V„ 0+n „ oV V.„)). (4 .3) 

croc 

• I* (H^Vp) ft e'C-i ») (p Vl r V0 ) I 2 {{-iu V2q ) ft -"'^ ^ ))) (p V2 Tvo ) , 



for (3.3) and 

got 

\ j dg T vo (-iv voj )(in vo ) fl^ e'KA.+n./lr,)) o({-in Vl ) e m ^°^ (4.4) 

(TOO 

0((-in V2 )fl 2 e^-"'^ +n ^ ^)(p V2 T Vl ) 0((-in V3 ) ■^ +n ^°^)(p V3 r Vl )) , 

for (3.5), with the conventions following (3.3) about the dummy integration variables. 

• The integration operations are still fairly involved, as it can be seen from (2.25) and from the expressions 
for J 2 and O. With the above conventions for the dummy variables and noting that, for any function F in 
M, 

I 2 (F(t)) (t) - J" 1 Jo (J( ff0 (t - r)F(r))(t) - I(g (t - r)F(r))(0)) , (4.5) 

we see that the integration over the t v has (by (2.25)) one of the two forms, when p v = 1 and v' is not the 
root (so that j Xv = j v - £), 

(1) l((w M {T v ,)w m {r v )-w m {r v ,)w w (T v ))e^ j Xv = , 

(2) l(g (T v ,-T v )e^'-^+ n ^°^G v (T v ))(T v ,), 0<j Xv <£, 

where G v (t v ) is a function that depends on the structure of the tree formed by the nodes preceding v and 
by the labels attached to the nodes. If p v = it has one of the two forms 

(1) ^(vjlK^Je'^''^^^^"^^))^), jx v =0, 

(2) l{g O T v e^'^ +n ^°^G v (T v ))(0), < j Xv < I . 



• When v' is the root the operations involved in the evaluation of the r„-integral are slightly different if 
j XvQ — j Vo , i.e. if we are considering contributions to the action coordinates, (if j\ = j Vo — i we still 
have integrations of the form (4.6) or (4.7)). If j Xvo = j Vo the integrations are particularly simple if we are 
interested in the evaluation of the splitting vector (1.7), that is j Vo > I and t = 0; in such a case the last 
two of (2.25) are relevant and setting v — vo the integration over t Vq is the value for t v > of 

(1) 1 (w 00 (T v )e^'^ +n ^ a ^G v (r v )) (0) , 

(2) l(e l{ "'- v ^ +n ^ a ^G v {T v )) (0) , 



(4.E 
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(4.9) 



(4.10) 



because, if t v > = 0, one has w«(0) = 1 and wio(0) = 0; see (2.15) and the last two of (2.25). 
More generally, if t v i = t =^ 0, setting v — v n and r — v' , one defines for p Vo = 1 

(1) X{{w m (t v ,)w w (t v ) - W oo(r,0^o£(T„))e 4 ("'-^^+"^°^))a(T,))(T^), h v = , 

(2) I(g (r v , - r v ) e^'^+ n ^°^G v {T v )){T v ,), < j Xv < I , 

(3) 1((w u (t v ,)w 00 (t v ) - ^o(T,0^(r t; ))e i ^'- I/ "^ + "^ ^))G„(T„))(T,0 , j Xv = £ , 

(4) l(e^'^+ n ^ a ^G v (T v ))(r v> ) , j Xv > £ , 

(see the last two relations in (2.25)) and for p Vo = 

(1) W oo(r„0^o£(T,) e ^'-^ + "^°^»G„(T,))(0), j K =0, 

(2) j(g Q r v e^'^+ n ^"^G v (r v ))(0), < j Xv < £ . 

(3) w m (T v ,)l(w M (T v )e^'^+ n ^°^G v (T v ))(0) , j Xv = £ , 

(4) , j Xv > ; 

note that, for t v > =t = and j Xvo > £, (4.9) and (4.10), summed together, give (4.8). 

• Hence each node still describes a rather complicated set of operations: it is, therefore, convenient to 
consider separately the terms that appear in (4.6)-^(4.10). This can be done by simply adding further labels 
at each node. To this end, looking at the integrals in (4.7) and (4.10), at p v = 0, and in (4.6) and (4.9), at 
p v = 1, we see that the following kernels are involved in the integrals 

,,,o ,<_. _ n, _ J ?«oo (tv)wo£ (t,,), v >v ,j v =£ -> j\„ = , 

^ { v ' ' " ' I 9or v , v > v , j v > £ ^ < j Xv < £ , 

!w o(t)w oe (T Vo ), 3v =£, j\ vo =0, 

9 or v j V0 >£ <,, <£, 

we (t)woe(T Vo ), j Vo = £, j Xvq = £ y > 

0, jv > £, jx vo > I , 

i l x / w m (t v ,)w o(t v ) - woo{T v >)w e(T v ), V > V , jv = £ —> jx v = , 

' Tv > - \ 90 (T V ,~ T V ), V > v , j v > £ ^ < j Xv < £ , 

!w e(t)w ao (T Vo ) - w Q0 (t)w M (T Vo ), j Vo = £, j Xvo = , 

9o(t - t Vo ), j V0 >£, < j Xvo < £ , 

W U (t)woo(T VQ ) - Wto(t)w t(T Vo ), j Vo = £, j XvQ = £, 

l, jv > £, j\ vo > £, 

respectively appearing in (4.7) and (4.10), at p v = 0, and in (4.6) and (4.9), at p v = 1. 
The function in (4.11) involving the Wronskian matrix elements can be computed from (2.15) and one 
finds, for instance, that the function in the seventh row on the r.h.s. is 



/ \ / \ i \ i \ 1 \ go(r v '-r v ) smhgoTV sinhgor^ \ 

woe(T v ')w 00 {T v ) - woo(T v ')woe{T v ) = 7; \ — r r 1 r r f ; ( 4 -12) 

2 [cosh g a t v > cosh g a r v cosh g T v cosh g Q T v > J 



hence if we consider (4.6)-^(4.10) we note that the integrals over t v involve functions that can be written, 
for p — p Vl r — t v ,t' = t v ' and for suitable coefficients Cj(p, a, v), (p = 1 if we consider (4.6), (4.9) and 
p = if we consider (4.7), (4.10)), 

2 

]T T^(pr',r)Y^(r',T) Cj (p,a,v) , (4.13) 
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where Y^(t',t) are given, if x = e - a9oT and x' = e" CT9oT ', by 



1 • V, oo oo 

y( " )(T '' T) = 2^? exp[iV(r)] -EE , fc' odd, 



fe'=lfe=-l 

OO oc 



2 cosher k^ti 

OO 

Y^(t',t) = exp[iV(r)] = E^M** , *' = 

fc=0 

which define the coefficients y^\k', fc) for a = —1, 0, 1, 2 (it is easily checked that fc' is odd in the first three 
relations) and we set, for a = —1, 0, 1, 2, 

{go(r' — t) if a is either or 2 and p = 1 , 
g r if a is cither or 2 and p = , (4.15) 

1 if a is either — 1 or 1 . 

Likewise we shall set, defining the coefficients y^"\k', k), for a = —1, 0, 1, and y„ X \k' , k), 

oc oo 

Y^\t',t) = -tanh ffo r'F (a) (T',r) = J E E ^ Q) ( fc '' , a = ±1, fc' = odd , 

fc' = — a fc= a 
oo oo 

Y (0) (r',r) = -tanh 50 T'F (0) (r',r) ^ ^^(fc',^'^ , fc' odd 



fc'=i k=i 



f( 2 )(r',r)=y( 2 (r',r)^Eyi 2 )(0,fc)x fc , (4 ' 16) 



fc=l 

oo oo 



v / oo OO 

F ( V,r) = ^Lexp[m^(r)]^ E E^V^V, fc ' odd ' 

Ti ( V, r) = ff0 (r' - r), f[ 2 \r', r) = 1, f (0) (0, r) = T (0) ; 

in all other cases the T, T, T-functions will be defined 1 (no matter which is the value of the labels that we 
attribute to them: this is done to uniformize the notation. 

The label k will be called the incoming hyperbolic mode and fc' the outgoing hyperbolic mode for reasons 
that become clear by contemplating (4.19) below. 

In terms of (4.14)-f-(4.16) the functions (4.11) multiplied by exp[m„<p°(T„)] can be expressed as in (4.13), 
thus defining implicitly the coefficients Cj(p,a,v) in (4.13): 

< (t v „t v ) expK^r,)] = (^(0,r,)y(0)(r.,r,) + y(-)(^,r,), j Xv = jv -t = 0, 



w° jx (t^vo) exp[m„ <p (T„ )] = < 



' T^(Q,r Va )Y^{t,r Va ) +Y(-V(t,T V0 ), j Xvo =j vo -e = 0, 



Tf\o,T Vo )YW(t,T Vo ), o < j Xvo =j V0 -z<e {A 17) 

f (0) (o,r, )0 )y( )(i,T WO ) + y(- 1 )(t,r t)0 ), j vo =j Xvo =e, 

I 0, jv„ = j\ vo > I, 
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W )a„ ( T *>' ' T « ) eX P \- in v V>° ( T v )] 



Ti (tv , r„) (t„/ , r„) + y« (tv ,t v )+ 

7\ (2) (t„, , r„ ) y ( 2 ) (r„, , r„) , < j A „ = j v - £ < £, 

[ T[°\t,T Va )Y^{t,T Va ) +YW(t,T V0 ) + 

-Y(-V(t,r V0 ), j Xvo =j Vo -£ = o, 

T[ 2) (t, r va )YW(t,r V0 ), < j Xvo = j V0 -£<l, 

f[ a \t,T Va )Y^(t,T Va )+Y^{t 1 T Va ) + 

-Y(-V{t,T V0 )) +Y {1 \t,T v ), j Vo =j Xvo =£, 

K f?\t,T V0 )YM(t,T V0 ), jv =jx vo >£- 



One could avoid introducing the T functions as they are simply related to the T functions or are just 
identically 1: however it is convenient to introduce them to make the above formulae more symmetric and 
therefore easier to keep in mind while working with. 
Finally we define the coefficients £j(k', 0) by the power series expansion 



^7 = J2Mk',0)x' h ' , k'>l, odd , 
fc'=i 

1=0(0,0), j>£, 



cosh g r 



(4.18) 



where x' = e~ agoT and k' is odd, which occurs as coefficient w o(t') in (4.7) (when p v — 0, i.e. v e Ql)- 
The above definitions (taken from (42) and (45) in [Ge2]) suffice to discuss the whiskers (and therefore the 
splitting in the action variables). 

• The (4.13) allow us to introduce a "relatively simple notation": we can add to each node a badge label 
a v = (—1,0, 1,2) that will distinguish which choice we make between the possibilities in (4.14) and (4.16) 
and two hyperbolic mode labels k' vl k v which select which particular term we choose in the sums in (4.14) 
and (4.16); they are integer numbers > —1. We shall not have to introduce labels to distinguish terms 

coming from the expansions of y(«),F(«),y (a) bearing the same badge a because one can check that the 
labels a v together with j v and v itself uniquely determine which choice has to be made. 
In terms of the latter labels we can define a hyperbolic momentum of a line A„ as a label p(v) £ Z which 
will be the sum of all the hyperbolic modes of the nodes that precede v plus the incoming hyperbolic mode 
of the node v itself: this is the sum of the labels k w associated with all free nodes w < v, with v included, 
and of the labels k' w associated with all the free nodes w < v or highest nodes of the leaves w < v, with v 
not included, 

P {v) = k v + + k ' w ) + Yl k '™ ■ ( 4 - 19 ) 

u,<v 

A very important property is that k w + k' w > 0, by (4.14) and (4.16), and k' w > if w £ Ql, by (4.18), so 
that p(v) > — 1. Furthermore if p(v) = then either k v = — 1 and k w + k' w = for all w < v except one 
single node w < v (which is either in df or 9^) for which k^ + k!^ = 1, or k v = and k w + k' w = for all 
w < v. If p{v) = —1 then k v = —1 and k w + k' w = for all w < v. 

• In the above analysis we have not taken explicitly into account the possibility of contributions to <f>'f cr 
coming from the third line in (2.12), i.e. counterterm contributions. They are, of course, possible and they 
can be taken immediately into account in the graphical representation by considering the nodes with a label 
5 V = and adding to them a counterterm label k v , a non negative integer. If k v = this will mean that 
the node represents a contribution from the first line of the definition of <f>'j:' T , i.e. a contribution that is 
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unrelated to the counterterms, while if k v > 1 the node represents a contribution from the term with p = n v 
in the second line contribution to <E>+ cr in (2.12). 

4.3. The trees carry, at this point, quite a few decorating labels and each tree together with all its labels 
will represent a "very simple" contribution to the value of the h-th order coefficient in the Taylor expansion 
in e (at fixed 77 of course) of the S' lcr vector. Very simple means that the improper integrals that correspond 
to each term are very easy to evaluate explicitly and lead to a result that can be expressed as a product of 
factors determined by the labels of the tree and associated with the nodes and with the lines, see (4.30), 
below. We list here the set of labels that have been introduced: 



3v 


action labels 


3K 


branch labels 


S v 


order labels 


Pv 


leaf labels 


U-v 


mode labels 


v{v) 


momentum in the branch At, following v 


fo(w) 


free momentum in the branch X v following v 


a v 


badge labels 


(k v , k v ) 


hyperbolic mode labels 


p(v) 


hyperbolic momentum in the branch X v following v 


K v 


counterterm labels 



There are some constraints between the labels, which follow from the rules stated in §4.1 and §4.2 and 
from the choice of the counterterms (the latter will be discussed in §4.5 below): 

• one has j\ v = j v — I if v < v and j\ v = j v or j\ v = j v — £ if v = v (see the third item in §4.1); 

• if p v = then a v ^ 1, (see (4.17)); 

• if j\ v ^ 0,£, then a v = 2, otherwise if j\ v = 0,£, then a v can be —1,0, 1, (see (4.17)); 

• 6 V = implies j v = £ (by the a-independence of /o); 

• k v ,k' v ,p(v) > —1, (see (4.14), (4.16) and comment following (4.19)); 

• (p(v),Vo(vj) 7^ (0,0), see Remark 4.6 below. 

In terms of such labels, given a decorated tree $0 with mo nodes and with highest node v , we can define 
the value of a subtree $ with to free nodes, highest node w (preceding the highest node vo of $o : w < vq) 
and label j\ w = 0, . . . , £ — 1, p w = 0, 1. It will be given by the expression 



where the integration is the improper integration X (in the sense of §2.4), the tree d consists of a "free" 
rn-nodes tree df with leaves attached to a (possibly empty) subset of the nodes of and the following 
notation has been used. 

(i) The coefficients V v (i}) and C v {&) are described by the collection of labels enumerated above. They can 
be written, respectively, as 




(4.20) 



V„(l?) = 



F f( a ^(o T 1 



T v )e 



X' 



(4.21) 



and 



(4.22) 
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where x v = exp[— ag r v ] and 

y { n v v \T^ ] are (see (4.17)) cither 

• y { nj\ T^ v \ if either v < v or v = v and j A „ = jv - 4 or 

• V ( nJ ] or y ( n} and fj"^ or T^"\ if v = v and j Xvo = j Vo . 

Furthermore p v — 1 if v < w, while p w can be either or 1; j\ w can any value 0, 
any other case j\ v = 0, ...,£— 1 (see above). In (4.21) 



n 



, 2£ — 1 if u> = «o , in 



(4.23) 



depends on the labels (8,v_,j) of the node v and of its predecessors w's (recall that by (2.2) u v = (n v , u v ))\ 
in (4.22) the quantity L. v °,A'd) is called the "value of the leaf" v of order h v (see item (v) below for its 

definition). The matrix J is not, in general, a multiple of the identity and Jo J -1 will be interpreted as 
acting on the rotator components of u v (and it will be 1 when raised to the power 0). 

(ii) For the purposes of the cancellations analysis performed in Appendix A3, the exact form of a few 
coefficients among the VrZ v \k' v , fc„)'s turns out to be essential, so that we list them here: 



/ i; 1) (2,-i) = o 

2/^(0,1) = 0, 
2/^(1,0) = 0, 



,(-!) 



tf^(-l,l) = 
2/^(0,0) 



(1,-1)= a/2, 
a/2, 



y ( n - 1) (l,0) = 2in v , 
y { n}(-l,2) = 2in v , 
y { n](0, 1) = Ain v a . 



(4.24) 



The coefficients y^ —1), y( 1 '(1, 0), 1, 1) and y^ (—1, 2) are equal to the corresponding (i.e. with 

the same values of the labels k' , k) y( a \k', k) coefficients. 

(iii) The value of a leaf with highest node v in (4.22) is not the same as the value L^,-. in [Ge2]: this is 
because of the above mentioned change in notation (see the sixth item in §4.1)). In [Ge2] leaf values are 
defined as sums of the values of all leaves (in the sense we use now) with fixed order, action label and total 
momentum. Then the leaf value considered here, L. v ^ v ^('d), is a single contribution to the L. v ^ v ^ of [Ge2], 
and depends only on the part of the tree d consisting of the nodes w < v; if we call i?„ such a subtree, we 
can write (temporarily, just for the purposes of comparison) the present definition of leaf value as L^,A'&) 

= Lj V °^(i!} v ) (as it depends only on the labels of the subtree In order to make a link between the 
different notations note that in [Gc2] would be, with our present notations, just the sum 



(4.25) 



3v =3 



where i?„ is the part of the tree $ on which the leaf value really depends. 

Coming back to our notations we define the leaf value ($), with j = j Vo , (where the the first and third 
of (4.11) should be used), to be the value of a tree i? with j\ vg = j Vo — I and p Va = 0. 

(iv) By construction (see (2.12) and corresponding comments), and if Ql is the set of highest nodes in the 
leaves, the total perturbation order k of •& is 



k = E Kr » + E Sv + E hv = E Kv + E Sv 

J x — n 



m < 2k . 



(4.26) 



S„=0 



(v) Both the counterterms and the leaf values of a given perturbation order are recursively defined in terms 
of the same quantities with lower orders. In fact r y K (go) admits a graphical representation as sum of tree 
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values defined as in (4.20) with the difference that the integration operation corresponding to the highest 
node of the tree has to be suitably modified (see (4.32) below). 

If Val (#) is defined as in (4.20) (and in item (iv) above) then, by construction, one has 

(t ;c *)= tw^". ~&® = -k\ £ Vaiw ' (4 - 27) 

where is the collection of all trees with total momentum v and order h. In (4.30) mo! -1 is a combinatorial 
factor, which depends on the way we count trees: the simplest is to think that the tree branches of # are 
pairwise distinct and are distinguished by a label 1,2, .. . , mo, if mo is the number of nodes in the tree 
i?. In the latter case, which corresponds to our choice, the factor is simply mo! -1 , see [Gl,Ge2], provided 
we regard as identical two trees that can be overlapped by pivoting the branches entering a node (rigidly, 
together with the subtree attached to them) around any node: as in [Gl] we shall call numbered trees the 
trees so counted. 

4.4. Remark. Since the value of any leaf with highest node v depends only on the labels of the nodes 
w < v, the equation (4.20) factorizes into a product of leaf values times a product of counterterms (whose 
value, so far arbitrary, has still to be specified and it will be, in the analysis between (4.31) and (4.32) when 
intervening compatibility requirements will dictate its value) times a factor 



Pv90T v 



[n / 



d 9 oT v V v (#)][ J] MK,oj\ > ( 4 - 28 ) 



veBi 



which does not depend on the leaves. 



4.5. The extra effort with respect to the approach without counterterms developed in [Gl,Gel], gives here 
(as in [Ge2]) a reward: few combinations of powers of the "times" t v appear in the integrand in (4.21). The 
time variables, by (4.21) and (4.14)^(4.17), appear only via exponentials like 

e -a(r v -r v ,)a QJ . ^ _ ^ e -a(r v -r v ,)a ^ (439) 

for some complex a = (gop(v) — iauj' ■ vo(v)), yielding respectively, upon integration, a -1 or a~ 2 . Note also 
that by the hierarchical structure of the trees one has a (t v > — t v ) > 0. 

This greatly simplifies the actual performance of the integration operations which, once one gets familiarity 
with the formalism, are trivial. One can say that the absence of high powers of the r„'s is due to having 
a priori fixed the Lyapunov exponent go by means of the counterterms (by contrast in [Gl,Gel] arbitrary 
powers of t v appeared because go is not fixed a priori ) . 

Of course the triviality of the integrations is entirely due to the above very fine decomposition, into 
terms identified by labeled trees, of the more compact (but "difficult" to integrate) integrands appearing in 
(4.6)^(4.12) and in the middle terms in (4.14). 

Once all the integration operations will have been performed, the tree value in (4.20) will become a product 
of "factors", in complete analogy with what one is accustomed to find when defining Feynman graphs in 
Field Theory. The factors are associated with the nodes v and with the branches X v . The value of a tree ■& 
will then be defined as 



Val(i9) = e iu} ''" ( v °') t - a9o { k 'vo + p( Vo ^ t \ TT ( ^5 V" 

L ^ gop( v ) — icw' • vq(v)J 



[ n ^(-i^-^rHCMjf n uco)Lj;; (t ,)W][ n 



(4.30) 
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where r v is either 1 or 2, and the case {vo(v ),p(v)) = (0,0) has to be excluded for any node v £ This 
is not to claim that no trees with uq(v) — 0,p(v) = can be drawn by following the above rules: this is 
a further rule to impose on the labels in order that the analysis does not become contradictory requires 
fixing the function j(e,go) conveniently: the consistence criterion determines 7(e, <7o) uniquely. This rule 
is a natural extension of the corresponding rule holding in perturbation theory of KAM tori, which was 
discussed by Lindstedt and Newcomb for the lowest orders of the perturbation expansions and which was 
proved to hold at all orders by Poincare, [P]; see the last paragraph in §2.4 above and the Remark 4.6, (c), 
below. 

The values of the numbers r v arise from the time variables integrals via the mechanism just illustrated 
above (whereby one either gets a -1 or a~ 2 from the integration of the functions (4.29)). 

The factors — (egoY" [goP( v ) — i^' ■ l/ o(v)]~ r " , associated with the branches, will be called propagators or 
small divisors. The first name arises from the possible interpretation of the trees as Feynman graphs of a 
suitable field theory, see [GGM1]; the second name corresponds to the usual name given in Mechanics to 
such expressions generated by perturbation expansions. 

It can be useful to write, if v is the highest node of i? and j\ vo = 0, 

Val(tf) = e ^'-Mv )t-Tg [k' VQ +p(v ))t( a9 ° Y v ° Val(tf) ( 4 .31) 

V g p(v ) - io-lj' ■ v (v Q ) J 

so defining the quantity Val(i9) (this is a well known kind of operation on Feynman graphs, which associates 
with a graph another value gruesomely called the value of the amputated graph - amputated tree in our 
case). Moreover we can define Val(i?) also for (i^q(vq) , p(vo)) = (0,0) as no vanishing denominator appears 
in its expression. It is however clear that nodes with (vo{v), p(vo)) = (0,0) must not appear at all in the 
trees that we consider, for (4.30) to make sense as it is written. This implies, not surprisingly, a consistence 
problem: namely one has to check that the sum of all the Val("$) over trees of a given order and with 
("0(^0)) P( v o)) = (0,0) cancel so that lines A„ with (v (v),p(v)) = (0,0) never appear, neither for v = v 
nor for v < vq. 

The cancellation is made possible because we still have freedom to fix the counterterms and their choice is 
in fact uniquely determined by the conditions that they be such that the needed cancellation takes place. The 
quantities Val(?9) are convenient in order to find and to express the counterterms and also the "resonance 
values" introduced later (see Appendix A3). One checks, see Appendix Al, that the counterterms can be 
explicitly written, if T v ^h is the collection of all trees with total momentum v and order h, as 

lM = ~\ £ VaT(tf), (4.32) 

p(„ ) = 0,„ („ ) = 0, k' VQ =l 

and the * means that the sum is further restricted so that the tree contains no leaves. This choice being 
simply imposed by the requirement that no contribution with (v (vo),p{vo)) = (0,0) can arise for &?(t); 
see Appendix Al. 

4.6. Remarks, (a) The presence of the counterterms will manifest itself not only through the elimination 
of the trees whose value would be meaningless if evaluated via (4.30) but also, and mainly, in the fact that 
the elements of the algebra M. met in the successive integrations have a special form (namely always like one 
of the (4.29)) which implies that the result of the improper integrals is not as complicated as one could fear 
from (2.24). This leads to the simple expression (4.32) (see Appendix Al of [Gl] for what would otherwise 
happen without counterterms). 

(b) From (4.31) one sees that if j\ VQ > I it is natural to collect together the terms with p(v ) = 0: for 
them, since j\ v > £, in (4.30) one must have k' Vo +p(vo) = by the last of (4.14). Note also that by (2.26) 
the case (vo(v),p(vo)) — (0,0) is excluded. If j\ v < i we, likewise, collect the terms with k' Vo + p(vo) = 
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and, for similar reasons the term with k' + p(vo) = — 1 cannot be present (see again (4.14) and (4.16), and 
use p(vq) > —1 supplemented by the relations between the labels p(vo) and k' VQ which will be exhibited in 
§5-1). 

Hence the cases with p(vq) + k' v = —1 are excluded by construction 6 and we see that the sum of the values 
of the trees with p(v ) + k v > o = give us the equations for the actions and the angles of the invariant torus to 
which the whiskers considered are asymptotic: the terms with p(vo) + k v ' o — asymptote to quasi periodic 
functions of cot so that replacing by ij> G T^ 1 one gets a parameterization of the points on the tori in 
terms of a point ip G T 1-1 on a "standard torus". 

And the terms with k' V() +p(vo) = 1 provide the leading corrections. Since such terms are present already to 
order (as one sees from the expression of the pendulum separatrix) the distance between a point moving 
on the stable manifold of the torus and the torus itself will be proportional to x = eT gacrt as at — > oo so 
that go has the interpretation of Lyapunov exponent of the invariant torus; see (2.27) in §2.27. 

(c) Summarizing: the case (v>o(v),p(v)) = (0,0) has to be ruled out as a consequence of (2.26) and of 
(4-32), respectively for the contributions to S^ a and to S+ CT (see the last constraint listed at the beginning 
of §4-3)- All cases with k' Vg + p(v a ) = —1 are also excluded. 



§5. Bounds. 

5.1. We now discuss how to bound the value of a tree or of a sum of a small number of trees which we take 
for simplicity without leaves and without counterterms. The more general case will be eventually reduced, 
see below, to the one we consider here. 

We shall discuss first how to bound values of trees without leaves and counterterms such that p(vo) = if vq 
is the highest node; hence we shall consider trees, always without leaves and counterterms, with p(vo) = 0. 
At the end we shall see how the presence of leaves and counterterms modifies the analysis. 

The following discussion is "locally" simple, but "globally" delicate and repeats that in [Ge2], §4: the 
conclusions are also summarized in the table 0,1,2,3 below. 

From (4.19) it follows that the hyperbolic momentum p(v) is p(v) > —1 and, as remarked after (4.19), 
p(v) = can occur only in special cases: more precisely if p(v) = 0, then k v is either —1 or 0, and 

(1) if k v = 0, all free nodes w preceding v (whether immediately or not) have k' w + k w = 0, while 

(2) if k v — —1, all free nodes w preceding v have k' w + k w = 0, except for a single node w < v such that 
k w + kfi = 1. 

In the latter case we call V the path of nodes (i.e. the totally ordered set of nodes) which connect v to w, 
both extremes included (see also [Ge2], §4). 

• Supposing p(v) = —1,0 and recalling that a v < 2 implies k' v odd, the expansions (4.14) impose that there 
are very few possible choices of the values of the hyperbolic modes at w < v: 

(1) if there is no path or there is a path linking v to w but w ^ w and w ^ V, then k w + k w i = and 
the cases a w = 1, a w = —1 and a w = 2 require, respectively, (k' w ,k w ) = (—1,1), (k' w ,k w ) = (1,-1) and 
(k' wl k w ) — (0,0): correspondingly p{w) — 1, p{w) = —1 and p{w) = 0. While, for w G V, the value of p(w) 
"increases by one unit", i.e. p(w) — 2, p(w) — and p(w) = 1 for neP; 

(2) if w = w then k w + k w i = 1 and the cases a.w = 1, am = — 1 and a-a = 2 require, respectively, 
(*4,&«0 = (-1,2), (*4' fc «0 = (!)°) and ( k 'w^ k w) = (0,1) (correspondingly p(w) = 2, p(w) = and 
p(w) = 1). 

Note that in both cases a w = is not possible. 

The above analysis covers both cases v < vo and v = vo, as the functions in (4.14) and in (4.16) have the 
same dependence on r (hence on k). 



The initial data E hl7 (0,a) were determined precisely by imposing boundedness at at = +oo, i.e. by imposing the absence of 
divergent terms in the expansion in powers of x = e~ ffo CT * which would correspond to the terms with p(vq) = —1. 
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• The latter properties have to be considered as a further restriction to impose on the tree labels, and play 
an essential role for the discussion of the cancellations, [Ge2]. 

• Moreover if p(y) = then: 

(1) if k v — then v can be preceded only by leaves with the highest nodes w having j w > £, because k' w 
must be in such a case, so that the second of (4.18) applies; 

(2) if k v = — 1, then all the leaves again must have the highest node w with j w > £, except at most one leaf 
with highest node w with j w — £ and k!^ = 1. 

(Vertical) Table 0. Possible cases when p(v) = 0,-1. 



p(v) 


k v 




a v 


Jv 


-1 


-1 


odd > 1 


-1 


£ 





-1 


odd > 1 


-1 


£ 








> 1 


-1 


£ 








> o 


2 


> £ 



(Horizontal) Table 1. Cases p(v) — 0, w <£ V. 



{k w , k w ) 
p(w) 

Jw 


-1 

(1,-1) 
-1 

£ 




impossible 
impossible 
impossible 


1 

(-1,1) 
1 

£ 


2 

(0,0) 


> £ 


(Horizontal) Table 2. Cases p(v) = 0, w e V,w > w. 


a w 

(k w , k w ) 
p(w) 

Jw 


-1 

(1,-1) 


£ 




impossible 
impossible 
impossible 


1 

(-1,1) 
2 

£ 


2 

(0,0) 
1 

>£ 


(Horizontal) Table 3. Cases p(v) = 0, w = w. 


a w 

(k w , k w ) 
p(w) 

Jw 


-1 

(1,0) 


£ 




impossible 
impossible 
impossible 


1 

(-1,2) 
2 

£ 


2 

(0,1) 
1 

>£ 



• We extend the definition of path also to the case p(v) — 0, k v — 0, by setting V = if j v > £ and V = v 
if j v = £, only for purposes of notational convenience (see (5.3) below). This is consistent with the above 
tables and does not change them. 

5.2. Remark. Note that, if a tree (or subtree) $o with highest node vq has total hyperbolic momentum 
p(vo) = 0, then there is one and only one path V, and, if V ^ 0, then V connects the node u to some node 
w < v . In fact, if there is a path V ^ 0, the node w is so defined that k w + k!^ = 1: then if k VQ = there 
cannot be any of such nodes (and V = v in such a case), while if k Vo = — 1 there must be one and only one 
such node. This simply follows from the analysis in §5.1, by noting that all nodes w < vq except w must 
have k w + k' w = 0. 

5.3. The small divisors can be really "small" only when p(v) = 0: if p(v) ^ 0, they are bounded by g$ rv , 
i.e. by a quantity of order O(l). So one can consider all free nodes in the trees, among the ones having 
p(v) = 0, which are closest to the root. All free nodes v between them and the root have propagators which 
are not small because \p(v)\ > 1 (see also [Ge2], page 298). 
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Given a tree $o with m free nodes, from each subtree ending in a node «o with p(vo) = (here t>o is some 
node of i9 : it becomes the highest node of one obtains contributions which can be naturally collected 
together (recall Remark 4.4) into a contribution to the tree value (see (4.30)) consisting in a factor 

JlF^Gviu'-MvMv, (5-1) 

V<VQ 

(here the product is over all free nodes preceding v ) times a product of counterterms r y Kv (go) f° r v £ $f 
with 5 V = 0, times a products of factors £j v (k' v , Q)L. v °^ v y for each v G 6^; see (4.30). The vector u (v) is 
the free momentum (defined above; see (4.2)) flowing through the branch A„, the coefficients y' v are related 
to the expansions (4.14) via 

( l 



yi 1 J(-l,f) + yi„ 1) (f,-f)J =a/2 if v G 7>, a v G {-1, 1} , 
(_ i)<5 Q „,-i y^ v v \k v i, k v ) otherwise ; 



where denotes the path in z9 (there is always one such path, possibly the empty set, because we suppose 
p(vo) = 0); and G v \w' ■ vq{v)] is related to the propagator of the branch A„, and it will have the form 



G v [u)' ■ v (v)] = < 



-ogl [gl + («' • ^ («)) 2 ] if vf V, j v = t, , g g) 

[-a ( 5 op( v ) - W • «, ( W ))]-< 2 - 5 -') if v G P, a„ ^ -1, - p(«) ^0 
k go [iw' • ^(«)] 1 if v G P, a„ = -1, 



because: 

(a) The first line is such because if j v > i one has necessarily a v — 2, see Tables 0,1,2,3 and we have to 
integrate a function go(r v i — T v )e lnvip ^ so that k v > 0: hence k v = p(v) = and we have the second 
function in (4.29) to integrate. 

(b) The second line is such because if v V, j v — I we have w\(t v i ,T v )e mijip ^ which is a sum of three 
terms (see the third of (4.17)): the first has k v + k v > > 2 so is excluded (recall that p(vo) = and v < vo); 
while the second only sees the contribution to with k' v = —l,k v = 1, see (4.14), and the third only 
contributes by the term with k' v = 1, k v = —1 in Y^ 1 ^. In the two cases one has p(v) = 1 or p(v) = —1 
respectively; adding up together the latter two contributions and using the first of (5.2) to compute the sum 
of the coefficients we get 



5o - ■ v Q {v) -g {) - iau' ■ u a (v) 2 gl + (u 1 ■ v (v)) 2 

as it can be read from the coefficients in the intermediate column of (4.24) and from p(v) = a v = ±1. 

(c) The third line of (5.3) is obtained by noting that, if v G V, v > w one has p(v) = 1 + k v , so that, if 
j v = I and a v ^ —1, then p(v) > 0, see Table 2; if v = w and a v ^ —1, one has p(v) ^ 0, see Table 3 (note 
that a v ^ 2, so that we have to consider the first integrand in (4.29)). 

If j v > i then a v — 2, and, by the Tables 2,3, one has k v > — 0, k v > and k v + k v > = 1, so that k v = 1 and 
p(v) = 2; while, if v = w, then k v > = 0, k v > and k' v + k v = 1 imply k v — 1, so that p(v) = 1. So in both 
cases p(v) > 1. 

(d) The fourth line is found by looking at the Tables 2,3 as follows: if a v = —l,v G V,v > w, one has 
k v + K = 0, hence k v = — 1, k v i = 1 and p(v) = 0; this happens only if j v = t so that we have to consider 
the first integral in (4.29) and we get the fourth relation. 

This shows that the only trees that do not have a value tending to as t — > coo, i.e. are those with 
p{vo) + K a =0 (all the others tend to as a power of x = e~ 9o<Jt ), have propagators that are even functions 
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of the momenta flowing in them. In fact the observation on the absence of paths preceding vo implies that 
only the first two propagators in (5.3) appear in such trees. Since, as already remarked, the trees with 
p(v ) + k' Vo = give the equations of the tori this is an interesting check that the tori equations so obtained 
at t = +oo and t = — oo do coincide. A similar analysis, and check, holds for the cases j\ < I. 

5.4. Remark. Collecting together the contributions from a v — —1 and a v = 1, for v ^ V, is a convenient 
operation and has nothing to do with the deeper resummations that imply the cancellations necessary for 
convergence estimates: the systematic use of this operation should be described by adding a label to the 
trees on the nodes v V and replacing on the branches which give rise to one of the two propagators in 
(5.4) the a v label by the new label (e.g. a * label which indicates that we consider the sum of the values of 
a tree with a v = 1 and one with a v = —1). We shall do this without explicitly mentioning the new label, 
to simplify the notation. Moreover we can no more associate a label p(v) to a node of this kind, as two 
factors with different p(v) label (p(v) = ±1 for a v = ±1) have been considered together; nevertheless we 

def 

shall modify slightly the definition of p(v) by setting p(v) = 1 in such a case (and letting it unchanged in 
all the other cases). 

We shall continue to call G v [u)' ■ uq(v)\ a propagator as, for the purposes of the following analysis, only 
such modified version of the original propagators appearing in (4.30) plays a role. 

5.5. Furthermore we define the degree D of a propagator to be D = 2 if either v ^ V or v G V, j v > i (hence 
a v ^ —1), and D = 1 otherwise (the constraint, see (4.3), 1 < r v < 2 implies that the power to which the 
divisors appear raised is either 1 or 2); by extension we shall say that a branch A has degree D\ = D if the 
corresponding propagator has degree D. 

The coefficients F„ and y' v in (5.1) satisfy the bounds 

\y' v \<4N, 11 \F^\ < (CN 2 ) m , (5.5) 

v<v 

for some constant C depending on the perturbation /i in (2.1); see (2.6), (2.13) and (2.18). For instance 
one can take 

C = max{| J _1 | Jo, 1} max l/J ; (5.6) 

\n\<N ,\u\<N ~ 

see (4.23), where | J _1 | is the maximum of the matrix elements of the (diagonal) matrix J -1 . 
To bound the product in (5.1), we shall consider simultaneously the cases k Vo = 0, — 1; if k Vo = the path 
V is supposed to be reduced to a single node, v , or to the empty set, 0, depending on the value of j vo , 
(respectively j Vo = £, and j Vo > £, see above). 

What follows below and in Appendix A2 really goes beyond [Ge2], although it constitutes a natural 
extension of it. From now now let us consider the case I = 3 and the Hamiltonian (1.1). 

We shall assume first a condition on the rotation vectors stronger than the Diophantine one, as done in 
[Gl,GG,Ge2], i.e. we suppose that they satisfy a strong Diophantine condition 

(1) Co\u a -v\ > \u\- T , O^eZ 2 , =f 1 -iC( V ) , (5.7) 

(2) min C \u -v\-2 p > 2 n+1 , if n < 0, < \v\ < (2™+ 3 )- 1 /^ 

0>p>n 

where n,p E Z, n < 0, and 

wo = tT 1/2 (^i + V^J^Ai)- 1 ^' = (1, rT 1 ^! + ^J^AiY 1 ^) , (5.8) 

so that <jj' ■ v = ?7 1 / 2 (fi 1 + rji J -1 Ai)u>o ■ v. We suppose also that A\ e [-n~iR, n~iR], with R < JOi/2, 
so that T] 1 / 2 ^! + ■ n ^J- 1 A 1 ) > 7 7 1 / 2 Sli/2. 
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If we write u>' = (r)iCl\ + rjJ^A^j^i^) then the measure of the set of A'^s such that u>q verifies the 
strong Diophantine condition (5.7) has measure of size O^q 1 !]^ 3 ^ 2 ). 

By reasoning as in [GG], once the case of strong Diophantine vectors has been understood, it can be 
extended to cover also the case of the usual (weaker) Diophantine condition (expressed by (1) in (5.7) 
above). Alternatively one could follow the approach in [GM] avoiding completely considering condition 
(2) in (5.7) and assuming only the "usual" condition (1) in (5.7). We shall not perform such an analysis 
(which can be easily adapted from the quoted papers), and we shall confine ourselves to the case of strongly 
Diophantine vectors. 7 

1 _ i 

Keeping in mind that Co = rj^^e +sr) 5 is enormous we shall say that 

(1) G v [uj' ■ ^o(^)] is ° n scale 1, if Co|u> • Vo(v)\ > Co/4, or if p(v) ^ 0; 

(2) G v [u'-v Q {v)] is on scale 0, if 1/2 < Co|o;o • ^o(w)| < C /4; (5.9) 

(3) G v [u>' ■ u (v)\ is on scale n < -1, if 2"- 1 < C |u;o • f (w)| < 2™. 



5.6. Remark. Note that in the above definition of scale the second and third cases can arise only if 
p(v) = 0. The propagators on scale 1 can be bounded by 4 2 if p(v) = and by 1 if p(v) ^ 0. Note also that 
the definition of the scales n = and n = 1 is different from [Gl,Ge2]: this is an important modification, 
exploited in Appendix A2, useful in order to take advantage from the existence of different scale times. 

5.7. As it is well known, (5.1) cannot usefully be bounded by just taking the absolute value of each factor 
and bounding the denominators by using the Diophantine condition. This is true not only if one wants to 
get the improved bounds that we are studying, but also if one, more modestly, wants to show convergence 
for e small enough: this is a problem usually referred to as a " small divisors problem" . 

Useful bounds are nevertheless possible, as shown first in similar cases in [E], because one can collect the 
contributions from the various trees into pairwise disjoint ("non overlapping") classes whose values add 
up to a quantity that verifies much better bounds than the individual elements of the same class. Each 
class will be determined by one of its elements d called a representative. This means that there are 

important cancellations within each class. 

The classes can be constructed by collecting trees which have the same resonance structures. The key 
notion of resonance is recalled below and the description of the classes will follow it. 

Definition. A "cluster" T of scale ut will be a maximal connected set of branches with scales n> nx and 
with at least one branch of scale nx ■ 

A free node v will be defined to be internal to T, v G T, if at least one of the branches leading to it or 
coming from it, i.e. pertaining to v (as defined in §4), belongs to T; a leaf with highest node v' will be 
defined to be internal to the cluster T if v' € T. 

A branch A„ is called external to T if it does not belong to T but it pertains to a node v internal to T, and 
it said to be entering T if the node v' following it is in T, exiting from T if v G T, (note that an external 
branch of T is not any branch outside T). We define the degree Dt of a cluster T to be the degree of its 
exiting branch, and the order Ut of T to be given by the same formula as (4.26), with the extra constraint 
that the nodes are internal to T. 

Definition. A "resonance" V will be a cluster with only two external branches X Vo and X Vl carrying the 



Basically the argument is the following: the analysis that wc present docs not change if 2 P ,2" are replaced by exponentials 
in another base q (larger than 1) or even if they are replaced by j(p),i(n), where ■y(p)/qP p^_ QO ' li an d if m the second 

of (5.7) we substitute 2P , 2 n+1 , 2™+ 3 by, respectively, -y(p),f(n + l),j(n + 3). One then proves a simple arithmetic lemma 
(see [GG]), whereby it follows that, if the first of (5.7) is verified and if ~y(p) is suitably chosen, then the second holds with 
7(p),7(n + l),7(n + 3) replacing 2P, 2 n+1 , 2™+ 3 . 
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same free momentum, vq(vo) = t'o(^i) and with order "not too high", i.e. 



ky < max{iV- 1 2- (llA "o+ 3 )/ T ,( 7 Ar ?? )- 1 } 



def 4fii 



(5.10) 




The branch exiting from a resonance will be called a resonant branch, and the scale n\ V(j of the two branches 
entering and exiting the resonance will be called the resonance-scale. The degree of the propagator of the 
exiting branch will be called the degree Dy of the resonance. 

Even though a node v either with S v — 0, k v > 1 or with S v = 1, v v = is not a cluster in the above sense 
(because it does not consist of branches) we shall nevertheless regard it as a cluster when there are only one 
incoming branch and one exiting branch of equal scale. Therefore we shall also regard it as a resonance, if 



k v < max{7V- 1 2~ ( ™ A "o+ 3 )/ T j ( 7 Ar ?7 )- 1 } when S v = 0,k v > 1 and if 1 < max^-^^o + 3 V T ; (jNn)- 1 } 



when S v = 1, v v = 0; note that the restriction that if S v = = k v there are at least two branches entering 
v implies that no node with 5 V = = k v can be a resonance. 

Definition. A resonance will be called "strong" if p{v ) — p{v\) = 0. 

All resonances on scale < are strong (see Remark 5.8). 

5.8. It is important to note that a strong resonance of degree 2 is necessarily such that also the degree of 
the entering branch must be 2. No branch inside it can be of order 1 and no path can precede t>o- This is so 
because D\ va = 2 implies j Vo > i (see the first of (5.3)) and p(vo) = implies that a Vo = 2, k Vo — 0, k' VQ = 
so any path preceding v would necessarily imply the contradiction p{v a ) = 1. Also if D\ vi — 1, D\ va = 2 
one must have j Vl = t hence k Vl — (otherwise p(vo) > 0) so that k' Vl = 0: but a Vl < 2 and k' Vl must be 
odd. The cases D\ — 1, D\ —1,2 are both allowed. 

5.9. Given a tree let V be a resonance (if there are any) with entering branch X Vl of degree D\ = 2. 
Then consider the family of all trees which can be obtained from $ by detaching the part of the tree having 
X Vl as root branch and reattaching it to all the remaining nodes internal to V but external to the resonances 
contained inside the cluster V (if any); to the just defined set of trees we add all the trees obtained by 
reversing simultaneously the signs of the latter modes of the nodes (this can be done as the sum of the mode 
vectors u w of such nodes, w £ V , vanishes). The set of all the so obtained trees will be denoted JV(i?). 

The definition of resonance and the strong Diophantine condition insures that all the trees so constructed 
have a well defined value (i.e. no division by zero occurs in evaluating it with the above rules); see the 
Remark 5.10, (1), below. 

If the entering branch X Vl of the resonance has degree D\ = 1 then also the exiting branch A„ has degree 
D\ vq = 1, and we collect together with the considered tree also the tree which is obtained from i? through 
the following operation. Replace the resonance V with a single node v carrying labels 6 V = and k v — ky, 
if ky is the order of the resonance. The set of all the so obtained trees will be denoted by JV($): the 
definition of the class JV(^) wm therefore depend on the degree of the branch entering V. 

Then repeat the above operations for all resonances in Thus a class JF(i?) has been constructed and 
the number of elements of J-(-d) is bounded by the product ]J V iMy of the numbers My of branches in 
each resonance V which are not contained inside inner resonances. The latter product is bounded by 
exp J2v 2A/V < exp 2m; the J-(-d) can be obtained starting from any of its elements (which therefore we 
shall call representatives of the class): this is again a consequence of the strong Diophantine condition, see 



5.10. Remarks. (1) The strong Diophantine condition plays a role here that should be stressed. In fact 
one checks that because of it the scale of a branch inside a resonance cannot change too much, as one 
considers the different members of a given family. Not enough to change the sets of branches that belong to 
a given resonance and insures that the different families of trees do not overlap: for this reason the strong 



[Gc2]. 



27 Nov embrel998 



29 



Draft #20 



Diophantinc condition leads to a simplification of the analysis (the simplification in the simpler case of the 
KAM theory). A simplification that is however not major (as explained informally in [Gl] and as shown in 
[GG], see footnote 7 above). 

(2) To see how the above difficulty is bypassed by using the alternative approach of [GM1,GM2], we refer 
to the conclusive comments in [GM1], §3. 

5.11. Consider trees with p(vo) = 0, if vo is the highest node of the tree; then the expression of each tree 
value contains a product like (5.1). As mentioned in the introduction we consider only trees without leaves. 
Since the leaf values factorize with respect the product (5.1), they can be dealt with separately, and no 
overlap arises with the cancellation mechanisms acting on the product (5.1): so that leaves can be easily 
taken into account; see §A3.3 in Appendix A3 (see also [Gl,Gel,Ge2]). 

The counterterms can also be explicitly expanded in terms of tree values, according to (3.2), which again 
we can imagine to have no leaves, (see however the comments in §A3.3 below). 

The cancellation mechanisms described in [Gel,Ge2] (and recalled in Appendix A3) lead to the bound (on 
a given family J 7 ^) described above, in §5.9), see (5.1), (4.30), (4.23) 



,1/2 



2m 



(4iV 3 C') m 2 4m e 2m J] (cl N ~2- 2nN ~)(C^2- nN ~) 

[ n n n 2 (n -™* +3) yi 2 2(n -™* +3) 



n<0 T,n T =n i=l 



i=l 



(5.11) 



where 

• C = max{(2go/^i) 2 , 4 2 }C, with C the dimensionless constant defined in (5.6); 

• m is the number of nodes v > v ; 

• Nl is the number of propagators on scale n and of degree j in which can be written as 



E 



(-1) + J2 m r(«) . 

T 

i n T =n 



(5.12) 



where m 3 T (n) is the number of resonances on scale n and degree j (i.e. with entering branch having a 
propagator of degree j) contained inside the cluster T; 

• the terms N^, j — 1,2, which count the number of propagators which do not correspond to resonant 
branches plus the number of clusters on scale n and of degree j in satisfy the bounds 



(5.13) 



n— — oo j—1 



(with 7 = 4fii/5l2) which are proven in Appendix A2; 

• the first square bracket in (5.11) is the bound on the product of individual elements in the family J-(d) 
times the bound on their number Y[y 27Vy < e 2m , see above. 

• the second square bracket term is the part coming from the maximum principle, (in the form of Schwarz's 
lemma), applied to bound the sums of the tree values ( ll resummations ,, ) over the classes J^ifi) introduced 
above: this is a non trivial product of small factors that arise from the cancellations associated with the 
resummations, see Appendix A3. In (5.11) is the scale of the cluster Vi which is the i-th resonance inside 
T, as in [Ge2]; 

• the rf m l 2 arises as a lower bound on the small divisors of the form u>' ■ v on scale n — 1 (for n — 1 we 
use the better bound \uj • v\ > 2 2 r]^). 
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5.12. Remark. The first bound (5.13) holds for all n and for all Hamiltonians of the form (2.1). On the 
contrary the second bound in (5.13) will follow from the fact that the rotation vector u> has the form (5.8), 
with r\ small, and will be used to control the (huge) factors C in (5.11). 

5.13. Hence by substituting (5.12) and the first of (5.13) into (5.11) we see that, for j = 1,2, the m 3 T (n) 
is taken away by the first factor in 2 ]n 2~° n \ while the remaining 2~ jni are compensated by the —1 before 
the +m 3 T (n) in (5.11) taken from the factors with T = Vi (note that there are always enough — l's), and 
therefore (5.11) is bounded by 

2m 

3/iAm mn4mo8m^8m7Wi| TT -8mA f n2 < "+ 3)/x 



v l/2 



because the product of the factors C in (5.11) can be bounded by using the second of (5.13), since the 
product does not contain the n = 1 factor). The last product in (5.14) is bounded by 

oo 

II 2 - 8m7V " 2( " +3)/T <exp[8miV2 3 / T log2^p2-pH , (5.15) 

n=— oo p— 1 

hence, by adding the remark that the perturbation degree k and the number of tree nodes m are related by 
m < 2k, a bound on the sum over all the subtrees of order k with p(vo) = 0, i/(«o) = v (recalling that the 
number of trees with m nodes is < 4 m m!) is 



E II 4„ G >' ■ M*>)] Vv < BfrT™ , (5.16) 



1 v n ■d'er(i))vei>' 



for some positive constant Bq. The normalization constant |-F(i?)| is introduced in order to avoid overcount- 
ings: in fact if i?' e J r ('&) then -d E Fffi), so that, without dividing by in (5.16), each tree would be 

counted times. 

If Cq 1 = i]~2C(ri) is chosen as in the statement of Theorem 1.4, an explicit calculation gives the bound 
on (5.11) of the form (?y-3) 4fc B§, k > 1, and 

OO 

B = 2 18 (4:N 3 C') exp\2 + 4jNr]logr] + 8sjN^ +8N2 3/T \og2j2p2- p/T ] , (5.17) 

P =i 

which is bounded uniformly in 77 (for 77 < 1). 

5.14. In the previous section trees withp(«o) = have been considered; in particular only the contributions 
(5.1) arising from the value (4.30), once the corresponding tree has been deprived of leaves and counterterms, 
have been bounded and the bound (5.16) has been obtained through a suitable resummation operation. In 
such a case the sum over the labels (k' v , k v ) is trivial because the condition p(vo) = imposes that only a 
few values (up to three per node) can be assumed by the hyperbolic mode labels; also the sum over the 
mode labels v_ v cannot create any problems. In fact for any node v one has \v v \ < N and \n v \ < Nq (see the 
eighth item in §4.1). 

The cases p(vo) ^ as well as those involving graphs containing leaves or counterterms can be treated 
in the same manner as already done in [Gl,Ge2]. We provide, in Appendix A4, a quick description of the 
construction of the analyticity bound e = D^ 1 with 

e^ 1 = D = [B2 e £(2N +l) 2e - 1 (2N + l)] 2 , B = max(Sb»)" 1 I #i) (5.18) 



27 Nov embrel998 



31 



Draft #20 



and B\ is a suitable numerical constant. 

The part of Theorem 1.4 not concerning the connection between the average action A' and the rotation 
vector u) 1 nor the splitting size follows. 

5.15. Determining the exact splitting size (i.e. the leading behavior asymptotically as r\ — > with e < Brj 2 ) 
is not trivial because of the existence of major cancellations in the evaluation of the determinant of the 
splitting matrix; however the analysis in [GGM2] dealt with this question in detail: in the latter paper 
remarkable cancellations are exhibited and an exact formula for the splitting angles is derived (see (7.19) of 
[GGM2]). 

One gets the results in the last item of Theorem 1.4 simply if [GGM2] and the first part of Theorem 1.4 
(to estimate the remainders) are used: then the claimed bounds on the splitting follow immediately (see 
Remark 1.5). In [GGM3] an improvement of lemma 1 and lemma 1' of [CG] was used instead to control the 
density of tori in phase space (the lemmata in [CG] were, as such, useless already in the case in [GGM2] 
because they would require that e be far smaller than the So of Theorem 1.4); see [GGM3], where this is 
discussed in detail and differs from our case only because it relied on a theorem weaker than Theorem 1.4 
above (as the radius of convergence estimate there is proportional to 77 to the power |+ rather than our 2). 

5.16. Remarks. (1) The bound (5.16) and the discussion in §5.14 imply the convergence of the perturbative 
expansions for the parametric equations of the invariant tori (for the Hamiltonian (1.1)), if |e| < £o = 0(ij 2 ). 
This bound on the convergence radius should be compared with the value given by [GGM3], which, for 

gives e = 0(772/ log 2 77 x ). As usual the Lindstedt series gives a much better estimate 
than the classical method (i.e. an exponent 2 versus ~ 4.5). We do not see immediately how to improve 
substantially the classical estimate without important changes in the architecture of the proof of [GGM3] , 
although this should be possible; on the other hand, from the above analysis, £0 = 0(n 2 ) might be close to 
an optimal result. If so it should be no surprise that our analysis is so delicate. 

(2) In the Hamiltonian (1.1), (2.1) the polynomial dependence of the interaction on the rotators angles has 
very likely a purely technical motivation (as it simplifies the analysis) and could probably be relaxed into 
a more general analytical dependence, as in [BCG]. On the contrary the hypothesis that the perturbation 
is a trigonometric polynomial of degree Nq in tp is fundamental to get the correct asymptotic behavior, in 
order to apply the results in [GGM2], where the dominance of Mel'nikov integral is proven provided the 
perturbation is polynomially small in a power of rj Na (so that the results of [GGM2] become meaningless for 
No -» 00). 

(3) A bound of the form (5.16) holds under the weaker condition that 0(77) < e _SI? , with a < 1 (see 
(5.17)). 

(4) If q is defined as in §1.6 so that \e\C(r}) qn < 1 implies analyticity in e, the above analysis gives that q 
can be taken q = 8-fN. 

In general all the bounds found so far are not uniform in N; in order to deal with the analytical case in 
the frame of the exploited formalism one should bound the small divisors by using the results of [GM2] or 
Eliasson-Siegel's bound (see for instance [BGGM]), and use explicitly as in [BCG] the exponential decay in 
v of the Fourier coefficients 

(5) Note that we have convergence for \e\ < 0(f] 2 ), while the asymptoticity of the splitting estimate follows 
only for |e| < 0(rf) (with Q = 2(N + 3)), which much smaller for 77 — > 0. Then the question is: what will 
be the asymptotics for e small enough to be in the convergence domain but too large to be in the domain 
of the asymptotic result? There is some evidence that there is a critical value T c such that, if e — r] T , then 

for T > T c the asymptotic formula that we can prove only for T > Q holds, but for T = T c it is modified 

1 _ 1 

remaining qualitatively of the same size 0(e~^ v 5 ) and for T < T c it becomes qualitatively different. The 
analogy with the critical point scaling phenomena seems to be substantial and, keeping in mind that the 
above theory can be interpreted as a field theory, see [GGM1], one would say that the region T > T c is 
described by a trivial fixed point; a non trivial fixed point describes the case T = T c and another "low 
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temperature" fixed point describes the cases T < T c . Evidence in this direction comes also from the theory 
of the standard map and its developments, [La,Gel2,Gell]. 
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Appendix Al. Counterterms 

Al.l To order h one can write, by using (2.12), (2.25) and (4.11), 

t 

Z h - a (t) = j dTwl(t,T)($ h + ° (r; 7l ( ffo ),...,7h-i(.go))+7h(.9o)sin^ (r)) (Al.l) 

(TOO 



+ j dr «;§(*, r) ($f (T;7 1 (.go),...,7ft-iC9o))+7ft(3o)sin^ (r)) , 

(TOO 

where $^ CT (t; 71(30), Ih-i(go)) takes into account all the contributions to <^ h ^ r = (Jog , o) _1 -P+ CT except 
the only one explicitly depending on 7/1(30), which is given by r fh(ga) smip°(T). 

We shall impose, recursively, that the contributions to the integrands in (Al.l) arising from amputated 
trees (see comments after (4.31)) with (v (v ) , p(v a )) = (0,0), without leaves and without end nodes bearing 
a counterterm label (see Remark A1.2 below), compensate exactly the contributions due to the trees with a 
single node representing a counterterm of order h (i.e. the terms with jh(go) in (Al.l)). The first described 
type of contributions can be written 

00 

W 1 =wj(t,T)^(T; 7l (g ),...,'y h . 1 (g ))= ]T £ Wi(i/,p) e*"'-"-™"" (A1.2) 

„ eZ <-ip=-l 

in the first integral in (Al.l) and 

00 

Wo = w e (t,T)^(r; 7l (go),...,jh-i(go))= ]T ]T Wo(v,p) e™'^-™^ (A1.3) 

p=-i-veZ i - 1 

in the second one. Imposing that such contributions are canceled by the contributions with p = arising 
from w\(t, T)jh(go) sin^3 (r) and w®(t, T)^fh(go) sin<p°(T) gives our prescription on how to fix r fh(ga)- 
Since two integrals are involved (one for p Va = 1 and one for p Vo = 0), the first time dependent and 
the second time independent, two conditions may seem to be required: however note that sin<^°(£) = 
2 sinh got/ cosh 2 got is expanded in odd powers of x = e -0 " 90 *, hence the only terms appearing in (A1.2) and 
(A1.3) which can contribute to p = are, in both cases, those involving y^~ 1 '(k' VQ , k Vo ), with k Vo = — 1. This 
means that the contributions with p = arising from (A1.2) and (A1.3) are equal, so that no compatibility 
problem arises. 

Note that an expression analogous to (Al.l) is obtained also for S+ CT (i); however the terms with p = 
have the same form as in (Al.l) (sec (4.17) for j\ VQ — I and for j\ VQ = 0), so that if no term with 
(vo(vq) , p(vo) — (0,0)) contributes to E^f(t), then also no term with (v (vo),p(vo)) — (0,0) contributes to 
Hf(t). 
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A1.2. Remark. It may seem strange that we exclude from the definition of the counterterms trees with 
leaves: in fact one can imagine to realize trees with (v(vo),p(vo)) = (0,0) also by trees which contain 
leaves with a stalk bearing a label j w > I or j w — I and internal momenta (v>',p'). Such terms would 
give rise to a-dependent counterterms which of course are not allowed: however it turns out that the sum 
over all contributions to tree values of trees with (v(vo),p(vo)) = (0,0) from such trees cancel exactly: this 
is explained, together with the other cancellations built in our algorithm, in Appendix A3 (see §A3.5 in 
particular). 

A1.3. Let us consider the first integral in (Al.l). Corresponding to the node v$ of each tree whose value 
contributes to E^i) there is a coefficient y nvQ (k' VQ , k Vo ), see (4.14), (4.17). Then from (Al.l) and the just 
formulated condition to impose we obtain 

J2 Val(tf) +fh(go) w\{t,r) sin^o(r)| fe , =1;p=0 = , (41.4) 

p(vo)=o,k' Vg = i 

where the sum is over the set T 0y h of all trees of order h and momentum v(vo) = 0, with vo(vo) = (see 
Remark A1.2), p(v ) = 0, j Vo = I and k' Vo = 1; hence if p(vo) = 0,j Vo = one must have k VQ = —1, hence 
a Vo = — 1 and k' Vg = 1 which is a possible case indeed. 

A trivial calculation (just take into account that 2/n~^(l, —1) = er/2 and sin<£ (r) = Aax + 0(x 3 )) gives 

wj(t,T) sin^o(r)| fc/=lp=0 = 2 , (41.5) 
so that (4.32) follows; the above follows [Ge2], page 287. 



Appendix A2. (Improved) resonant Siegel-Bryuno's bound 

A2.1. We follow the idea of Poschel, [P6] (see also [Gl, GG,Ge2]). In the discussion, we focus on the scale 
labels, so that it is quite irrelevant which value the p(v)'s, v e assume, and therefore which resonances 
are strong and which are not. 

Calling N*("&) the number of non resonant branches carrying a scale label < n, in a tree t? with m nodes, 
we shall prove first that 

N*(d) < 2mE n - 1 , E n d = f N2 < - 3+n ^ T , n < 1 , (42.1) 
provided that N*(d) > 0, and 

JV *(i)) < 2m 7 A?7 - 1 , 7 = / 4Qi/n 2 , (42.2) 

if Ar *(tf) > 0. 

Define, as in §5.7, u = (l.jy- 1 ^ + r]i J-^i)" 1 ^)- Then C |u;o • v\ > \u\- T for all ^ v G Z^ 1 ; see 
(5.7). Assume also r\ so small that Co > 2, (this is not restrictive as we are interested in r\ — > 0). 

Set E n = N2 ( - n+3 ^ T as in (A2.1). Note that if m < E- 1 one has N*(d) = 0. In fact m < E- 1 implies 
that, for all v £ i9, \v (v)\ < NE- 1 , i.e. C a \u>o ■ M^)\ > (N- 1 E n ) r = 2"+ 3 , so that there are no clusters T 
with tit = n. Note also that if m < (jNr]) -1 , with 7 = 4f2i/f2 2 , then = 0, as \lj • Vo(v)\ > 1 for all 
v E 1? in such a case. 

A2.2. Let us prove first the inequality (A2.1). If $ has the root branch either with scale > n, or with scale 
< n and resonant, then calling #1, $2, • • • , $k the subtrees of 1? ending into the highest node vq of 1? and with 
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rrij > E n 1 nodes, j = 1, . . . , k, one has N*($) = iV*(i?i) + . . . + N*($k) and the statement is inductively 
implied from its validity for ml < m and from the just proved fact that N*("&) = if m < E~ x . 

If the root branch is on scale < n and non resonant, one has iV*(i?) < 1 + J2i=i ^n(^«) : if = the 
statement is trivial, if k > 2 the statement is again inductively implied by its validity for m! < m. If k = 1 
one has N*(i3) < 1 + 2m\E n — 1, hence we once more have a trivial case unless the order mi of i?i is 
mi > m — (2£7„) _1 : but in the latter case we shall show that the root branch of i?i has scale > n. 

Accepting the last statement (which will be proved below), one will obtain Af*(i?) = 1 + N*($i) = 1 + 
N*($[) + . . . + N*(i)' k ,), where i?^.'s are the k' subtrees ending into the highest node of with orders 
m'j > E~ , j = 1, . . . , k' . Going once more through the analysis the only non trivial case is if k' = 1 with 
the root branch of ■&[ non resonant; and in such case N*^) = N*($'{) + . . . + N n ("&'£„), etc., until we reach 
a trivial case or a tree of order < m — (2E n )~ 1 . 

It remains to check that if m — mi < (2E n ) then the root branch of di has scale > n. Let us proceed by 
reductio ad absurdum. Suppose that the root branch of $i is on scale < n. Then Co|u>o • i / o(vq)\ < 2™ and 
Co|w • fo(ui)| < 2", if v\ is the highest node of Hence Cq\u)q ■ (t'o(w) — Vo{ v i))\ < 2 n+1 (equality would 
imply violation of the strong Diophantinc property, (5.7)), and the Diophantine condition implies that 

M«b) - > 2-<" +1 )/ T = 5 , L42.3) 

because t'o(uo) "oC^i) (the root branch of $ being supposed non resonant). But m — mi < (2E n )~ 1 , so 
that |i/ (u ) - "o(vi)\ < (2E n )~ 1 N < 2- 1 2- {n+ ^l T = 2- {1+2 l T ^8 < S, which contradicts inequality (A2.3). 

A2.3. Let us prove now (A2.2). If $ has the root branch cither with scale 1, or with scale < and resonant, 
then calling $2, • • • , $k the subtrees of $ ending into the highest node v n of •& and with mj > (jNrj)^ 1 
nodes, j = 1, . . . , k, one has Nq ($) = JVq + . . . + ATq ($fc) and the statement is inductively implied from 
its validity for ml < m and from the fact that Nq (1?) = if m < (7.W77) . 

If the root branch is on scale < and non resonant, one has Nq($) < 1 + X)i=i -^0 (^«) : if fe = the 
statement is trivial, if k > 2 the statement is again inductively implied by its validity for any m! < m. If 
k = 1 we once more have a trivial case unless the order mi of #1 is mi > m — (2 r yNr])~ 1 : but in the latter 
case the root branch of $1 has scale 1. 

Accepting the last statement (which will be proved below), one will obtain Nq($) = 1 + Nq ($1) = 1 + 
Nq (f9'i) + . . . + Nq (i9' k i), where i?^'s are the fc' subtrees ending into the highest node of with orders 
> (2jNr])^ 1 . Going once more through the analysis the only non trivial case is if k' = 1 and in that 
case Nq = Nq ($'{) + . . . + N ('& k „), eic -' untu we reach a trivial case or a tree of order <m — (2"/Nrj)^ 1 . 

It remains to check that, if m — mi < (2jNr])^ 1 , then the root branch of $1 has scale 1. Suppose that the 
root branch of i?i is on scale < 0. Then p(v\) ^ and |o; ■ vo(vo)\ < 1/4, |u>o ■ fo(^i)| < 1/4, if «i is the 
highest node of $1, i.e. 

|«o-N(«d)-^(«i))| < 1/2. (A2.4) 

As the root branch of $ is supposed non resonant, then m — mi < (27^77)^ implies that < \vq(vo) — fo(i>i)| 
< (2jNi])~ 1 N = (2-fr])^ 1 , so that one would have |u>o • (^(^0) — "( u i))l > 1) which is contradictory with 
the inequality (A2.4). 

A2.4. A similar induction can be used to prove that if the number of branches on scale n is A^„(i?) > 
then the number p n {"&) of clusters of scale n verifies the bound 

PnW < 2m^2 (n+3)/r - 1 . (A2.5) 

In fact this is true for m < -E^ 1 , if £"« is defined as in §A2.1. Otherwise, if the highest tree node v is not in a 
cluster on scale n, one calls . . . ,i?fc the subtrees ending into Vo, and one has p n {$) = Pn{$i) + - ■ ■+Pn('&k), 
so that the statement follows by induction. If vq is in a cluster V of scale n, and . . ., are the subtrees 
entering the cluster containing t> and with orders mj > E~ x , one will find p n {$) — l+p„(i?i) + . . -+p n (^fc)- 
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Again we can assume that k = 1, the other cases being trivial. But in such case there will be only one 
branch entering the cluster V and it will have a propagator of scale < n — 1. Therefore the cluster V must 
contain at least E~ x nodes. This means that mi < to — (2E n )~ 1 . 
Finally, the bound 

o 

Pn(#) < ZmjNri - 1 (A2.6) 

n— — oo 

is a trivial consequence of (A2.2). 

A2.5. Let N* < N* be the number of non resonant branches on scale n. Then if N n is the number of non 
resonant branches plus the number of clusters on scale n, N* verifies the bounds 

K = (N* + Pn) -Pn = N n - Pn < 4mA2("+ 3 )/ T - 2 - ^ (1) < 4mN2 ( - n+3 ^ T + ^ (-1) . (A2.7) 

T T 
n T =n n T =n 

This proves that (A2.1) and (A2.5) imply an inequality analogous to the first of (5.13); likewise one derives 
an inequality similar to the second of (5.13) by combining (A2.2) and (A2.6). 



Appendix A3. Cancellations between resonances 

In this appendix we recall briefly the cancellation mechanisms of [Ge2]. We provide this as a guide to the 
reader and as a tune up of a fine points of the analysis of [Ge2] (the analysis in A3. 2 is given here in full 
details while in [Ge2] it was left out). 

A3.1. Consider a tree $ with a strong resonance V of order ky. Let A„ and X Vl be, respectively, the exiting 
and entering branches of V. There are two possibilities: either the degree of the propagator corresponding 
to the branch exiting from V is D\ vo — 2 or it is D\ vq — 1 (equivalently the degree of the resonance is 
cither Dy = 2 or Dy = 1). 

Let us discuss first the case in which the degree Dy of the resonance is Dy = 2. Then j Vo > I (see (5.3) 
and the comments after the definition of strong resonance in §5.9) and, by following the notations of §5.1, 
we shall say that V — 0, i.e. there is no path V ending into vq. It follows, from the properties of V discussed 
at the beginning of §5.1 above, that p(v\) = implies j = j Vl > I and D\ = 2 (see again (5.3)). 

Consider all the trees belonging to the class J-{{>) which are obtained from -d by detaching the subtree 
having as branch root the entering branch of the resonance and attaching it to all the remaining nodes 
of V (see the definition of the class ^V(i?) in §5.9). As a consequence of such an operation 

• some of the branches internal to the resonance have changed the free momentum by an amount uq{v\ ), 
and 

• if w is the node inside V to which the branch X Vl is attached and j Vl — I > 0, then F E (see (4.23)), has 
the form of an even function of u w times a factor (iv w j). 

We shall call resonance value TZy the product of factors appearing in the definition of tree value and relative 
only to the nodes and branches internal to the resonance V: 

Kv - K vo y' Vo J] F^VvGvW ■ Mv)} , (43.1) 

V<VQ 

and shall consider the resonance value as a function of the quantity \x = u>o ■ vq{v\). 

Then for fi = a quantity proportional to ^2 w£V v W j is constructed, but such a quantity is vanishing by 
definition of resonance, as j = j Vl — £ > 0. 
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If we sum also on an overall change of signs of the mode labels of the nodes internal to the resonance (by 
following the definition of the class T{d) given in §5.9), we obtain a zero contribution also to first order in 
fi (here the even parity of the perturbation / is essential, see [Gl,Ge2]). 

This can be seen by using the explicit form of the functions in (4.21), i.e. the coefficients listed in (4.24). 
Noting that in the present case there cannot be any V inside V the only propagators we can associate with 
the branches internal to V have the form of the two first terms of (5.3), so that, for = 0, they are even 
functions of the mode labels. Moreover in such a case the analysis in §5.1 shows that a v = — 1, a v = 1 and 
a v — 2 imply, respectively, k' v = —k v = 1, k' v = —k v = —1 and k' v = —k v = (the case a v = is not 
possible here): then no n v labels appear in the coefficients yn v v \k' v ,k v ) corresponding to the nodes v < vq 
(see the list of coefficients in (4.24)). Therefore all the dependence on the n v labels is through the factors 
F v in (4.23). This yields that there is an even number of the n v (if there are any) corresponding to the 
nodes v eV: two for each branch A„ with j v = £, by taking into account that j Vo ,j Vl > £, so that no change 
is produced by the sign reversal (since, by the parity properties of the Hamiltonians (1.1) and (2.1), one has 
also /*" = ). This means that the resonance value is an even function of jj,. 

A3. 2. Let us now consider the case in which the strong resonance is of degree Dy — 1 and the tree i? has 
no leaves inside V. In such a case a Vo = —1 and j Vo = £, hence D\ va — 1 (see (5.3)): then a first order zero 
in /i will be enough. Moreover there is a V inside the resonance: we shall distinguish between the cases 
v\ £ V and v\ G V. 

Let us consider first the case v\^V (in particular this is the case when V = v , k Vo = 0, provided ky > 2). 
In such a case j Vl > £ and we can reason as above to obtain a first order zero. Note that in such a case 
there would be no cancellations between tree values of trees obtained by the sign reversal operation. 

On the contrary, if V\ € V, then k Vo = — 1, and one has also a Vl = — 1 and j Vl = £. In this case consider 
together with the tree $ also the tree obtained from $ by performing the following operation (recall the 
definition of J-y {'&)): replace the resonance V with a single node v carrying labels 6 V — and k v — ky (if 
ky is the order of the resonance V), then express the counterterm 7 K „(go) associated with the node v in 
terms of trees. If i?i is the subtree having X Vl as root branch, then the values of the two considered trees 
i? and i?' can be written, respectively, as Val(i?) = A(d)Ky Val(i?i) and Val(tf') = A{d)Yy kv {g )(T /2]V&\(di), 
where a/2 = yi~ x \\, —1) and A($) takes into account the factors corresponding to all nodes not preceding 
v , and has the same value for both $ and . 

The resonance value IZy, for \i — 0, can be written as 

IZy — Val(i9o) in v ' % , for some $o S %.k with p(vo) — 0, k Va = —1 , (A3. 2) 

see the definitions (4.30) and (4.31) of tree value and the definition (A3.1) of resonance value: remember 
that we are considering resonances V with degree Dy = 1, so that k Vo = — 1 and, as a consequence, k' VQ > 1; 
see (4.14). The counterterm 7 K (go) can be represented in terms of trees as in (4.32); note that, if the tree 
contributing to 7«(<?o) has k Vo = —1, the condition a Vo = —1 implies that such a tree has a node w > «o 
with k w + k' w — 1, while all the other nodes v ^ w have k v + k' v = 0. 

Among the contributions in (4.32) to 7^ v (go) there will be a quantity Val(??2), where t?2 will have the same 
topological form of i9o in (A3. 2) with the node w such that k w + k' w = 1 corresponding to the node v[ E V; 
then we denote both nodes by w. 

Then VaI(i?o) will be related to Val(tf 2 ) by 



Val(t? 2 



ybv ^ (k' w , k w ) \k' w +k u 

Unj\k' wl k w )\k> +k u 



Val(^o) , (A3.3) 



so that a look at the coefficients listed in and after (4.24) shows that the factor in square brackets in (A3. 3) 
(when it is not vanishing) is equal to 4in w a. The quantity Val(i?2), in order to contribute to 7fc v (<7o)c/2, 
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has to be multiplied by a factor — 4c extra with respect to Val(i9o), which, on the other hand, has to be 
multiplied by in w in order to contribute to the resonance value TZy (see (4.32). 
Then, for n = 0, by summing the values of the two considered contributions one obtain 

Val(t?i) , (A3.4) 
which is zero by (A3. 3), so that a first order zero is obtained. 

A3. 3. If there are leaves, nothing changes in the discussion of §A3.1, as k Vo = implies that only leaves w 
with j w > £ are possible, and £ w (k' w ,0) = 1 in such a case (see (4.18)). 

In §A3.2, when discussing the case v\ € V, one has to take care of the case in which there is a leaf with 
highest node w with k'^ = 1 (such a leaf will be at the end of the path V) . In fact the resonances having 
as entering branch a branch of the path V cannot have any leaves with k' w — 1, while when considering 
the graphical representation for j K (go), there will be also contributions arising from trees containing a leaf: 
such contributions will be either of the form (4.32) with Val(i?2) = Val(f?2)wij^£ui (1, 0)2^^ ($2), or of the 
form 7k(<?o) = 7k-?h (<7o) (1, 0)L^f s ($2), where hi > 1, and "d' 2 is a suitable tree of order k — hi. 

Then one realizes that the two contributions cancel exactly, so that no new case has to be discussed with 
respect to the analysis of §A3.2. 

A3. 4. The above discussion completes the proof of approximate cancellations of resonance values (i.e. of 
cancellations to first and second order, according to the degree of the resonant branch). The existence of 
cancellations, approximate to the first or second order, is all is needed to obtain the bound (5.16): the 
analysis continues exactly as in [Ge2] and is based on simple analyticity arguments that allow us to exploit, 
via the maximum principle, the fact that in a resonance with momentum v the functions of /i — ujq ■ u that 
have been considered above have a zero in fj. of order 1 or 2. 

A complete analysis showing that the higher orders contributions (i.e. the part which does not cancel) can 
be performed as in [Ge2], Appendix B, and the final result is given by the bound (5.16) in §5.11. 

A3. 5. We shall show now that all contributions with (vo(vo) , p(vo)) = (0,0) involved in the definition of 
the counterterms (see Appendix Al) must have automatically also v(v ) = 0. The analysis performed in 
§5.1 shows that in order to have p(vq) = (for j Vo = £), there can be any number of leaves with highest 
nodes w such that j w > £ and only one leaf w with j w = £ (contributing, respectively, k' w = and k' w = 1 
to p(v )). 

Each time a leaf with j w > £ appears, if we sum together the values of all trees obtained by detaching 
the leaf with its stalk, then reattaching it to all the other nodes of 1?/ , we obtain a vanishing contribution: 
simply by the cancellation mechanism described in §A3.1 (assuring there the first order zero), which, now, 
is an exact cancellation as the leaf does not contribute to the free momenta of the branches 0/1?/, so that it 
does not modify the propagators. So we can suppose that no leaf with j w > £ is possible in trees involved in 
the determination of the counterterms. 

In the same way, if we have a tree $ having a leaf with j w — £ and h w = h — hi (for some hi), we can 
reason as in §A3.2 and consider, together with 1?, also the tree formed by only one free node, carrying a 
counterterm label hi and bearing the same leaf as The same cancellation mechanism described in §A3.2 
apply now: again the only difference is that now the cancellation is exact (by the same reason as before). 

This shows that no tree with leaves can contribute to (vo(vo),p(vo)) = (0,0), so that for such trees one 
has v(vq) = vq(vo) = 0. This, together with the analysis in §A1.1, proves (A1.4) in §A1.2. 



A(d) V&\(tf Q )in w -— Val(0 2 ) 
L 4(T 
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Appendix A4. Graphs with non zero total hyperbolic momentum, with leaves 
or with counterterms 

A4.1 Consider first the cases p(i>o) 7^ 0. In this case we consider the nodes w < vo with p(w) — and which 
are the nearest to vq: by construction all nodes z between vq and the just singled out nodes have p(z) =/= 0. 
Let us denote by $ the set of such nodes z and k\ the sum of their order labels. The subtrees having as 
root branches the branches exiting from the nodes w can be considered as trees of the kind of the previous 
sections (i.e. with p(w) = 0), so that the integrations corresponding to their nodes can be performed and 
discussed as before and a bound B 2ka v^ 2ka follows, if fco = k — k\. 

All the other nodes (in d) have p(z) ^ 0, so that no real small divisor appears (i.e. the propagators are 
trivially bounded by 1). The only delicate point to discuss concerns the sum over the hyperbolic mode 
labels, but this can be done as in [Gc2], p. 292, (or in [Gl], item 7 in Appendix Al), to which we refer for 
details beyond the summary that follows. 

By noting that the Laurent expansion of each function (of x and x') appearing in (4.14) and (4.16) starts 
from k > — 1 and k' >= 1, we can denote by Mi the maximum of all such functions (multiplied by l/x and 
1/x' respectively when k = — 1 and k' = — 1) in a disk of radius A = 1/2. If M 2 is the coefficient of the term 
with k v = -1 or k' v = -1, set M = max{Mi, M 2 }. 

Consider the tree value (4.20). If at < 1, the first integral (corresponding to the highest node u ) can be 
split into the sum of two integral, the first one from aoo to al (here the value 1 is an arbitrarily chosen 
positive number) and the second one from al to t. Let us denote by 7 TO (i?) the first integral and J m (^) the 
second one, if m is the number of nodes in i9. 

For the nodes v E one has 

(1) for each node the associated propagator is bounded by 1 (as p(v) ^ 0), 

(2) n^i^j<(civ 2 ) 2fci , 

(3) for each node v one has \yk" v) (k v , k' v )\ < M2 k - +k '-, 

(4) the last integration (on t Vq ) produces a factor exp[— k' Vo + p(vq)] = exp[— ^2 vG $(k v + k' v )]. 

Then the sum over the hyperbolic mode labels can be performed and gives, for each node v E a factor 
A 2 , where 

' 2 s 7 ' 



fc= 



^ej 2(e-2) 



(A4.1) 



The contribution to 7 m (t?) arising from $ a bound B 2kl is obtained, with B\ = A 2 CN 2 M, so that, for I m (z?) 
a bound 

B 2k , B > max{£ ?r\£i} (-44.2) 
is obtained (see (5.6) for the meaning of C). 

By taking into account the integral J m (d), one can perform a splitting of the integration domain for the 
integrals corresponding to the nodes immediately preceding vq (now for each such nodes the second integral 
is from al to t Vq ), and, iterating such a splitting, one finds that Val (■&) can be written as sum of at most 
2 m terms each of which has the form (for some integer p) 



n 



dr v (. . .) 



^(^•••^(tf,), (-44.3) 



where #1, . . . ,i? p arc disjoint subtress of $ and i9* is the set of the mo nodes in $ not belonging to any 
such subtrees and mi + . . . + m p + m = m. The dots between the parentheses denote the product of the 
functions in 

\{Y^\t' w ,t w ) (AAA) 

we'd 
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which depend on t v , for a given v G and therefore is a quantity bounded by 1 (see (4.14) and (4.16)). 
Note that the functions Y have no singularity as functions of their arguments x = e a9T , x' = e agT , when 
x, x' = 1 (or t, t' = 0), even though the values x, x' — 1 lie on the convergence circle (the singularities being 
at x, x 1 — ±i). Furthermore each integration from crl to iv, once the integrand has been bounded, gives 1, 
while the integrals I m (tij), j — 1, • • • ,p can be bounded as before. 

Of course, if at > 1, the discussion is easier as no splitting of the integration domains is needed. 
So we can conclude that a final bound (2B) 2k is obtained for Val ($); so far neither leaves nor counterterms 
have been considered. 

A4.2 Introducing the leaves and the counterterms, one sees (recall Remark 4.4) that the value of any tree 
•& can be always be written as the product of a factor like (4.28) times the product of the counterterms and 
of the leaf values; each counterterm can be decomposed in turn as sum of values of amputated trees (see 
(4.32)). As each leaf and each amputated tree can contain other leaves and counterterms we can iterate 
such a decomposition procedure, until, at the end, the value of the tree with highest node vq, turns out 
to be given by the product of factorizing terms which 

(1) either are of the form (4.28), with p w = for any subtree with highest node w < v and with p w = 0, 1 
if w = v , 

(2) or differ from (4.28) simply because no integration is performed corresponding to the highest node. 
The terms as in item (1) correspond to subtrees contributing to leaf values (for w < Vq) and to Val (■&) itself 

(for w — v ), while the terms as in item (2) correspond to amputated trees contributing to counterterms. 
Then a natural decomposition of the tree •& into subtrees •& (amputated or not) follows: each of such subtree 
contains neither counterterms nor leaves (by construction) . Furthermore the subtrees contributing to leaves 
are linked to nodes of some other subtrees through their stalks, while the amputated subtrees are not linked 
to any node (as there is no branch exiting from the highest node). To keep memory of the node to which 
the counterterm label is attached we can draw a hatched line connecting the amputated subtree to such a 
node. 

So for each subtree t? (amputated or not) one can reason as above and a bound B m ° is obtained, if B 
is the same constant as before and if mo < 2fco is the number of free nodes of the subtree. For all of 
them the resummation described in §5.9 has to be performed, to bound the values of the subtrees $ v , with 
p(v) = 0, v G v 1 : such a resummation is taken into account by the constant B. By collecting together all 
bounds one obtains, for the (normalized) sum of the values of the all trees generated by the resummations 
corresponding to the families J 7 a bound B 2k , if k is the order of ■&. 

Therefore we are left with the sum of all possible ways to arrange leaves and counterterms. The choice of 
the leaves is uniquely determined by the assignments of the labels p v , v G so that it gives a factor 2 m 
(recall that the number of nodes m is such that m < 2k). 

In the same way one can deal with the counterterms: simply one has to distinguish between solid and 
hatched lines, so that another factor 2 m is produced. 

A4.3 Then the sum over all the other labels can be performed, in the same way as for the contributions 
without leaves and without counterterms (see the beginning of this subsection) . The sum over the hyperbolic 
modes has been taken into account by the constant B (see (A4.2)); moreover 

• the sum over the mode labels is bounded by (2N + l) m ^-^(2N a + l) m , 

• the sum over the angle labels is bounded by £ m , 

• the sum over the order labels is bounded by 2 m , 

• the sum over the badge labels is bounded by 2 m . 

Therefore, by taking into account that the momenta and the hyperbolic momenta are uniquely determined 
by the mode labels and, respectively, the hyperbolic mode labels and that the sums over the leaf labels 
and counterterms labels have been already considered, we are left with the sum of (unlabeled) numbered 
trees (see comments after (4.27)): but these are no more than 2 2m m\, so that, both for X^(t) and jk(go), 
a final bound D k is obtained, for some constant D: in terms of B the constant D is given by D = 
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B2 6 i (27V +l) (e ~^ (27V + 1), i.e. (5.18). In particular one has that D is proportional to r\~ 2 , as B is so. 
Note that, as a matter of fact, we have bounded S^(t) in (4.27) by neglecting the constraint on j and v. 
Therefore, by making use of the fact that the Fourier coefficients with \v\ > h(N + No) vanish at order h 
as a consequence of the trigonometric assumption on the perturbation /i, see (2.2), the bound (2B) 2k is a 
bound both for the Fourier coefficients of X a {t; ct) and for the function X a (t; a) itself. 
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